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To all whom it may concern: 

Be it known that, \ye, 

Arnold E. Hampel and Richard H. Tritz 
have invented certain new and useful improvements in 
RNA CATALYST FOR CLEAVING SPECIFIC RNA SEQUENCES 
of which the following is a full, clear and exact description. 
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RNA CATALYST FOR CLEAVING SPECIFIC RNA SEQUENCES 

This application is a continuation-in-part of 
co-pending application Serial No. 07/577,658 filed 
September 4, 1990, which was a continuation-in-part of 
application Serial No. 07/409,666 filed September 20, 
1989, which was a continuation-in-part of application 
Serial No, 07/247,100 filed September 20, 1988, now 
abandoned . 

This invention was made in part with Government 
support under Grant No, DMB 8817576 awarded by the 
National Science Foundation and Grant No. ROl AI 29870 
awarded by the National Institutes of Health. The 
Government has certain rights in the invention. 

FIELD OF THE INVENTION 

The present invention relates to an RNA 
catalyst which cleaves specific RNA sequences into a 
fragment having a 5' hydroxyl and a fragment having a 
2' ,3' cyclic phosphate. The products of the reaction 
described herein resemble those resulting from the 
natural hydrolysis of RNA. 

BACKGROUND OF THE INVENTION 

Certain naturally occurring satellite, virusoid 
and viroid RNAs possess the property of self -catalyzed 
cleavage. Self -cleavage has been demonstrated in vitro 
for avocado sunblotch viroid (ASBV) (Hutchins, C. J., 
Rathjen, P. D., Forster, A. C. and Symons, R. H. (1986) 
Nucleic Acids Res. , 14: 3627-3640), satellite RNA from 
tobacco ringspot virus (sTRSV) (Prody, G. A. , Bakos, 



- 1 - 



- 2 - 

J. T., Buzayan, J. M. , Schneider, I. R. and Bruening, G. 
(1986) Science . 231 : 1577-1580; Buzayan, J. M. , Gerlach, 
W. L. and Bruening, G. B. (1986) Proc. Natl. Acad. Sci. 
U.S.A. Q3 ; 8859-8862) and lucerne transient streak virus 
(vLTSV) (Forster, A. C. and Symons, R. H. (1987) Cell . 
49 : 211-220). These Belf -catalyzed RNA cleavage 
reactions share a requirement for divalent metal ions and 
neutral or higher pH and cleave target RNA sequences to 
give 5' hydroxyl and 2' ,3' -cyclic phosphate termini 
(Prody, G. A,, Bakos, J* T., Buzayan, J, M. , Schneider, 
I. R. and Bruening, G. (1986) Science . 2?1 : 1577-1580; 
Forster, A. C. and Symons, R. H. (1987) Cell . £2.: 211- 
220; Epstein, L. M. and Gall, J- G. (1987) Cell . Ifi: 535- 
543; Buzayan, J. M. Gerlach, W. L., Bruening, G. B., 
Keese, P. and Gould, A. R. (1986) Virology , HI: 186- 
199) . 

A "hammerhead" model has been proposed and 
accurately describes the catalytic center of (+)sTRSV 
RNA, the (+) and (-) strands of ASBV and the ( + ) and (-) 
strands of vLTSV (Forster, A. C. and Symons, R. H. (1987) 
Cell . 4£: 211-220). The single exception is (-)sTRSV RNA 
which does not fit the " hammerhead " model (Forster, A. C. 
and Symons, R, H, (1987) Cell . 49 : 211-220; Buzayan, 
J.. M. , Gerlach, W. L..and Bruening, G. (1986) Nature . 
323 : 349-352; Buzayan, J. M. , Hampel, A, and Bruening, 
G. B. (1986) Nucleic Acids Res. . i£: 9729-9743), and the 
structure of whose catalytic center was unknown prior to 
the present invention. It is therefore understandable 
that the primary scientific focus has been on studying 
the "hammerhead" consensus structure and, as regards 
sTRSV, on studying the (+) strand. 

Intermolecular cleavage of an RNA substrate by 
an RNA catalyst that fits the " hammerhead w model was 
first shown in 1987 (Uhlenbeck, 0. C. (1987) Nature . 221: 
596-600). The RNA catalyst was recovered and reacted 
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with multiple RNA molecules, demonstrating that it was 
truly catalytic. 

Catalytic RNAs designed based on the "hammer- 
head" motif have been used to cleave specific target 
sequences by making appropriate base changes in the 
catalytic RNA to maintain necessary base pairing with the 
target sequences (Haseloff and Gerlach, Nature . 334 . 585 
(1988); Walbot and Bruening, Nature , 22£, 196 (1988); 
Uhlenbeck, 0. C. (1987) Nature , 328 : 596-600; Koizumi, 
M. , Iwai, S. and Ohtsuka, E. (1988) FEBS Lett , , £28: 228- 
230) . This has allowed use of the catalytic RNA to 
cleave specific target sequences and indicates that 
catalytic RNAs designed according to the "hammerhead" 
model may possibly cleave specific substrate RNAs 
in vivo . (see Haseloff and Gerlach, Nature . 334 . 585 
(1988); Walbot and Bruening, Nature . 334 . 196 (1988); 
Uhlenbeck, 0, C. (1987) Nature . 328 : 596-600). 

However, catalytic RNAs such as those that were 
designed based on the "hammerhead" model have several 
limitations which restrict their use in vitro and may 
forestall their use in vivo . For example, the 
temperature optimum for the reaction is 50-55°C, which is 
well above physiological, and the kcat (turnover number) 
is only 0.5/min even at 55°C (Uhlenbeck, O. C. (1987) 
Nature . 328 :596-600: Haseloff and Gerlach. Nature . 334 . 
585 (1988)). In addition, the Km is 0.6uM (Uhlenbeck, O. 
C. (1987) Nature , 12&:596-600) , meaning that the reaction 
requires high concentrations of substrate which makes it 
difficult, if not impossible, for the catalytic RNA to 
cleave low levels of target RNA substrate such as would 
be encountered in vivo . 

Cech et al. published application WO 88/04300 
and U.S. Patent No. 4,987,071 also report the preparation 
and use of certain synthetic ribozymes that have several 
activities, including endoribonuclease activity. The 
design of these ribozymes is based on the properties of 
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the Tetrahvmena ribosomal RNA self -splicing reaction. A 
temperature optimum of 50°C is reported (page 39 of WO 
88/04300; col. 20, lines 4-5, of U.S. Patent No. 
4,987,071) for the endoribonuclease activity, and the Km 
and kcat reported for this activity are 0.8uM and 
0.13/minute, respectively (Example VI, last paragraph) . 

In view of the above, there is a need for an 
RNA catalyst having a lower temperature optimum, prefer- 
ably near physiological temperatures, a higher turnover 
number and a smaller Km and which can be engineered to 
cut specific target RNA substrates. Accordingly, based 
on the discovery of a totally different structure 
disclosed hereinafter, it is an object of the present 
invention to provide such an RNA catalyst. Other objects 
and features of' the invention will be in part apparent 
and in part pointed out. The invention, accordingly, 
comprises the products and methods hereinafter described 
and their equivalents, the scope of the invention being 
indicated in the appended claims. 

SUMMARY OF THE INVENTION 

The invention comprises a synthetic RNA cata- 
lyst capable of cleaving an RNA substrate which contains 
the sequence: 

5' -F r CS-F 2 -3' , 

wherein, 

CS is a cleavage sequence; and 

Fj and F 2 each is a sequence of bases flanking 
the cleavage sequence. 

The catalyst comprises a substrate binding 
portion and a "hairpin" portion. The substrate binding 
portion of the catalyst has the sequence: 
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3' -F 4 -L,-F 3 -5' 

wherein, . j Q 

F 3 is a sequence of bases selected so that F 3 is 

substantially base paired with F, when the catalyst is 

bound to the substrate; 

F 4 is a sequence of bases selected so that F 4 is 
substantially base paired with F, when the catalyst is 

bound to the substrate; 

the sequences of F 3 and F 4 being selected so 
that each contains an adequate number of bases to achieve 
sufficient binding of the RNA substrate to the RNA cata- 
lyst so that cleavage of the substrate can take place; 

Ll is a sequence of bases selected so that L, 
does not base pair with CS when the catalyst is bound to 

the substrate. 

The "hairpin" portion is a portion of the 
catalyst that assumes a hairpin-like configuration when 
the substrate-catalyst complex is modeled in two dimen- 
sions for minimum energy folding. The "hairpin" portxon 
of the catalyst preferably has the sequence: 

.P 3 -S 2 -P 4 -3' 

wherein, , 
P, and P 4 each is a sequence of bases, the 

sequences of P, and P 4 being selected so that P, and P 4 are 
substantially base paired; 

P, is covalently attached to F 4 ; 

Sl and S a each is a sequence of bases, the 
sequences of 8, and S, being selected so that 8, and S, are 
substantially unpaired; 



P 2 and P 3 each is a sequence of bases, the 
sequences of P 2 and P, being selected so that P 2 and P 3 are 
substantially base paired; and 

Lj is a sequence of unpaired bases. 

RNA catalysts according to the invention can 
cleave substrates of any length or type as long as they 
contain an appropriate cleavage sequence. In particular, 
the catalysts can be used to cleave a specific sequence 
in naturally- occurring RNA having a cleavage sequence, as 
well as RNAs which have been engineered to contain a 

cleavage sequence . 

The invention further comprises an engineered 
DNA molecule and a vector, each of which comprises a DNA 
sequence that codes for an RNA catalyst according to the 
invention. The invention also comprises a host 
transformed with the vector, the host being capable of 
expressing the RNA catalyst. In particular, hosts can be 
transformed with vectors that, when transcribed, will 
produce RNA catalysts which can cleave any RNA, native or 
foreign, found in the host. For example, hosts can be 
transformed with vectors that, when transcribed, produce 
RNA catalysts which can regulate the expression of genes 
by cleaving messenger RNA or which act as ant i- viral 
agents by cleaving viral RNA. Thus, the invention has 
application in vitro and in vivo in prokaryotes and 
eukaryotes of plant or animal origin in regulating gene 
expression and for controlling viral infections. 

Finally, the invention includes a method of 
cleaving an RNA substrate comprising contacting the 
substrate with an RNA catalyst according to the inven- 
tion. The reaction is unique because it occurs under 
physiological conditions, having a temperature optimum 
near 37°C, with very favorable reaction parameters. The 
method can be practiced in vitro or in viy g. For 
instance, the method may be practiced in vivo in host 
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cells that have been transformed with a vector that codes 
for an RNA catalyst according to the invention, 

BRIEF DESCRIPTION O P THE DRAWINGS 

Figure 1 shows the (-)sTRSV RNA substrate - 
catalyst complex that fits the "hairpin" model of 
catalytic RNA in accordance with the present invention. 

Figure 2 shows minimum energy folding of 
(-)sTRSV RNA. 

Figure 3 shows the time course of catalysis of 
a substrate RNA by the catalytic RNA. 

Figure 4 shows the Michaelis-Menten kinetics of 
the RNA catalytic reaction. 

Figure 5 shows the temperature dependence of 
the RNA catalytic reaction. 

Figure 6 shows the dependence of the rate of 
reaction on concentration of catalytic RNA. 

Figure 7 shows the reaction properties of a 
smaller RNA substrate. 

Figure 6 shows the loss of catalytic activity 
when the terminal A at position 175 or the terminal bases 
AU at positions 175 and 176 are removed from the 
catalytic RNA. 

Figure 9 shows loss of catalytic activity when 
bases 195-203 in the catalytic RNA sequence are removed. 

Figure 10 shows loss of catalytic activity when 
bases AAA at positions 203 , 202 and 201 are changed to 
CGU respectively. 

Figures HA-C show that there is no effect on 
catalytic activity when base A at position 49 in the 
substrate is changed to a G, U or C. 

Figure 12 shows that different target RNA 
sequences can be used as long as the base pairing with 
the catalytic RNA in the regions flanking the cleavage 
sequence is maintained. 
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Figure 13 9hows that an RNA sequence found in 
tobacco mosaic virus can be cleaved at a specific site 
with the catalytic RNA of the present invention.. 

Figures 14A-C show three substrates having 
sequences found in the sequence of the messenger RNA 
coding for chloramphenicol acetyl transferase. Figures 
14A-C also show the separation patterns on acrylamide 
gels of the reaction products obtained by reacting these 
substrates with catalytic RNAs designed to base pair with 
the substrates in the regions flanking the AGUC cleavage 
sequence. 

Figure 15 shows the sequence of a substrate 
having a sequence found in the sequence coding for the 
gag protein of the HIV-1 virus which causes AIDS. 
Figure 15 also shows the separation patterns on acryl- 
amide gels of the reaction products obtained by reacting 
this substrate with a catalytic RNA designed to base pair 
with the substrate in the regions flanking the CGUC 
cleavage sequence of the substrate. 

Figure 16 shows the sequence of a substrate 
having a sequence found in the sequence coding for the 
regulatory tat protein of the HIV- 1 virus. Figure 16 
also shows the separation patterns on an acrylamide gel 
of the reaction products obtained by reacting this sub- 
strate with a catalytic RNA designed to base pair with 
the substrate in the regions flanking the UGUC cleavage 
sequence of the substrate. 

Figure 17 shows the sequence of a substrate 
having four non-native U's added to the 3' end of the 
sequence of the native (-)sTRSV substrate shown in Fig- 
ure 1. Figure 17 also shows the separation patterns on 
an acrylamide gel of the reaction products obtained by 
reacting this substrate with different concentrations of 
a catalytic RNA designed to base pair with the substrate 
in the regions flanking the cleavage sequence of the 
substrate, including with the four non-native U's. 

- 8 - 
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Figure 18 summarizes the sequence requirements 
for the target region of the substrate RNA. Only GUC is 
required for cleavage. 

Figure 19 shows the positions of base changes 
(open boxes) made in the sequence of the catalytic RNA 
shown in Figure 1 in order to prove the existence of 
Helices 3 and 4 predicted by the "hairpin" model for 
(-)sTRSV RNA. Figure 19 also shows the separation pat- 
terns on acrylamide gels of the reaction products ob- 
tained by reacting the various catalytic RNAs with sub- 
strate RNA S17. 

Figure 20 shows the RNA sequence of an auto- 
catalytic cassette that has utility in terminating 
transcription at a very specific site. Figure 20 also 
shows the separation pattern on an acrylamide gel of the 
reaction products obtained when this catalyst was 
transcribed and cleaved autocatalytically . 

Figure 21 shows the positions of two base 
changes that were made in the native (-)sTRSV catalytic 
RNA sequence shown in Figure 1. Figure 21 also shows the 
separation patterns on acrylamide gels of the reaction 
products obtained by reacting these catalysts or catalyst 
R51 (control gel) with substrate S17. 

Figure 22A shows a plasmid map illustrating 
pertinent features of vector pHC-CAT which contains the 
CAT gene linked to the "hairpin" autocatalytic cassette 
of the invention. Also shown is the expected RNA tran- 
script of the illustrated region of pHC-CAT and the 
expected 5' fragment of the autocatalytic cleavage. 

Figure 22B shows the results of Northern blot 
analysis of RNA produced by host cells transformed with 
pHC-CAT. 

Figure 23 : Map of the pMHC-CAT mammalian 
expression vector. 

Figure 24 : Map of plasmid pHC19R. 
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Figure 25 : Shows the results of SI nuclease 
protection assay of RNA from transfected CHO cells and 
RNA controls. 

Figure 26 : Map of viral vector constructions 
(CMV) with the hairpin autocatalytic cassette in the 
sense orientation (pCS101HC7) and the antisense 
orientation (pCS101HC9) . 

Figure 2 7A : Shows the results of SI nuclease 
protection assay of RNA from infected turnip plants and 
RNA controls. 

Figure 27B ; Results of Northern blot analysis 
of the RNA isolated from control and infected plants. 

Figure 27C : Results of PCR amplification of 
viral sequences from control and infected turnip plants. 

Figures 27D and 27E : Results of Western 
immunoblot analysis of protein extracts of turnips leaves 
from control and infected plants. 

Figure 28 : HIV-1 target sequence. 

Figure 29 : Conservation of the HIV-1 target 
sequence in various HIV isolates. 

Figure 30 : Sequences of HIV-l substrate 
("SHIV") containing the conserved target sequence and of 
an engineered "hairpin" catalytic RNA ("RHIV") designed 
to cleave the substrate. 

Figures 31 and 32 : Results of tbe cleavage of 
SHIV substrate RNA by RHIV catalytic RNA. 

Figure 33 : Sequence of longer HIV-l transcript 
and results of its cleavage by RHIV. 

Figure 34 : Map of pHR and partial sequence. 

Figure 35 : Results of cleavage of substrate 
SHIV by RHIV and by catalytic RNA produced by T7 RNA 
polymerase transcription of pHR. 

Figure 36 : Map of pMSGRHIV and partial 

sequence. 

Figure 37 : Map of plasmid pMRHPT and partial 

sequence. 
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Figure 38 : Selection and testing scheme for 

RHGPT. 

Figure 39 ; Results of SI nuclease assay for 
reduction of HGPRT mRNA in CHO cells transf ected with 
pMRHPT and pMSG-dhfr. 

Figure 40 : Map of the plasmid pMCATRCAT and 
partial sequence. 

Figure 41 : Results of SI nuclease assay for 
reduction of CAT mRNA in CHO cells transfected with 
pMCATRCAT and pMSG-dhfr. 

Figures 42A-C : Summary of mutagenesis 
experiments with the (-)sTRSV RNA substrate- catalyst 
complex. 

Figure 42D : A more refined secondary structure 
model for the (-)sTRSV RNA substrate- catalyst complex. 

Figure 43 : Separation patterns on an 
acrylamide gel of the reaction products obtained by 
reacting substrate with "hairpin" catalytic RNA having 
the loop that closes the "hairpin" replaced by the 
sequence GGAC (UUCG) GUCC . 

DETAILED DESCRIPTION OF THE DRAWINGS 

Figure 1 , The "hairpin" model for RNA catal- 
ysis was developed by determining the minimum energy 
folding predicted by computer modeling of the catalytic 
complex containing the minimum size catalytic RNA and 
substrate RNA of (-)sTRSV RNA. It is this minimum energy 
folding which is shown in Figure i. Two molecules are 
shown folded: (1) catalytic RNA which contains 50 bases 
of satellite RNA (224-175) and (2) substrate RNA which 
contains 14 bases of satellite RNA (53-40) • The arrow 
represents the site of cleavage. 

The 50 -base catalytic RNA and the 14 -base 
substrate RNA are the "minimum size" in the sense that 
reductions in their length result in a substantial or 
total loss of catalytic activity as is shown in the 
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Examples below. Thus, this length of (-)sTRSV catalyst 
sequence is preferred to shorter lengths. Also, sub- 
strate RNA having at least the degree of base-pairing 
with the catalyst exhibited by the 14 -base substrate is 
preferred. 

Ficrure 2 . Minimum energy folding of 
(-)sTRSV RNA. The molecule was folded using the 
Wisconsin RNA folding program (Zucker, M. and Stiegler, 
P. (1981) Nucleic Acids Res. . 133-148; Devereux, J., 
Haeberli, P. and Smithies, 0. (1984) Nucleic Acids Res.. 
12: 387-395) with base numbers corresponding to ( + )sTRSV 
(Buzayan, J. M. , Gerlach, W. L. , Bruening, G. B. , Keese, 
P. and Gould, A. R. (1986) Virology , 151 : 186-199). With 
this numbering scheme the 5' -3' direction of the molecule 
is with decreasing base number. The minimum catalytic 
complex is identified. The substrate RNA sequence is 
between bases 53-40 and the catalytic RNA sequence is 
between bases 224-175. The arrow is the site of 
cleavage. 

The folding identifies regions of expected base 
pairing and expected non-base pairing, loops. This model 
does not preclude higher order interactions occurring 
between the loops. 

Figure 3 . Time course of substrate S17 cleav- 
age by catalytic RNA R51. The reaction was carried out 
under standard conditions, which were 37°C in 12mM MgCl 2 , 
40mM Tris pH 7.5 and 2iriM spermidine, for the following 
times: lane 1, 30 sec; lane 2, 5 min; lane 3, 15 min; 
lane 4, 30 min; lane 5, 60 min; lane 6, 90 min; lane 7, 
150 min. Concentrations were as follows: [R51] =0 . 0032uM 
and [S17] «0.09uM. RNA was separated on 7M urea, 20% 
acrylamide gels, bands cut. out and counted in the liquid 
scintillation counter for Figures 3-17 and 19-21. 
Throughout the figures, the designations 5'F and 3 f F are 
the products of cleavage of the substrate and represent 
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the resulting 5' fragments and 3' fragments, respec- 
tively. 

Figure 4 . Eadie Hofstee plot of catalytic RNA 
R51 cleavage of substrate RNA S17. The reaction was 
carried out at 37°C in 12mM MgCl 2 , 40mM Tris pH 7.5 and 
2mM spermidine. Concentrations were as follows: 

[R51]=0.0004uM and [S17] =0 . 125uM (lane 1), 0.0625uM (lane 
2), 0.0417uM (lane 3), 0.031uM (lane 4) , 0.021UM (lane 
5), 0.0156uM (lane 6), 0.0078uM (lane 7) and 0.0039uM 

(lane 8) . 

Figure 5 . Temperature dependence of the rate 
of cleavage of substrate RNA S17 by catalytic RNA R51. 
The reaction was carried out in 12mM MgCl 2 , 40mM Tris pH 
7.5 and 2mM spermidine at 45°C (lane 6), 41°C (lane 5), 
37°C (lane 4), 33°C (lane 3), 27°C (lane 2) and 20°C 
(lane 1). The concentrations used were: [R51]= 0.003uM 
and [S17] =0.09uM. R51 was unlabeled. The velocities 
shown in the graph in Figure 5 were calculated by the use 
of time points of 8 and 16 minutes. The separation 
patterns of the reaction products on the acrylamide gel 
shown in the figure are for the 16 -minute time point. 

Figure 6 . Rate of cleavage of substrate RNA 
S17 at varying concentrations of catalytic RNA R51. The 
reaction was carried out at 37°C in 12mM MgCl 2 , 40mM Tris 
pH 7.5 and 2mM spermidine for 40 min (lane 1 and 2), 20 
min (lane 3), 10 min (lane 4) and 5 min (lane 5). The 
concentration of substrate used was 0.175uM. 

Figure 7 . Eadie Hofstee plot of catalytic RNA 
R51 cleavage of substrate RNA S10. The substrate S10 
containing the RNA sequence: GACAGUCCUG was prepared 
from a DNA template containing the T-7 promoter as 
described in Example 2. This substrate was mixed with 
the catalytic RNA, R51, from Example 2 under standard 
conditions: 37°C in 12mM MgCl 2 , 40mM Tris pH 7.5 and 2mM 
spermidine for 10 min. Concentrations of substrate used 
were as follows: 0.115uM, 0.77uM, 0.038uM, 0.029UM, 
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0.0l4uM. The concentration of catalytic RNA, R51, used 
was lnM. The line was fit by linear regression analysis 
and intercept, kcat, and Michaelis constant, Km, 
calculated. 

Figure 8 . Removal of the terminal "A" at posi- 
tion 175 of the catalytic RNA. The "A" at base position 
175 (circled) was removed and the resulting catalytic 
RNA, R50, reacted with substrate S17. In addition, two 
bases were removed, to give R49 which had both A175 and 
U176 removed. Concentration of substrate S17 was 0.3uM 
and all catalytic RNA concentrations were 4nM. The 
reaction times were 20 min under standard conditions. 
Lane 1 R51/S17; Lane 2 R50/S17; Lane 3 R49/S17; Lane 4 
S17 alone. A 75% loss of activity was seen with R50 and 
R49. 

Figure 9 . Loss of activity when bases 195-203 
in the catalytic RNA sequence are removed. When the 
underlined bases were removed and the adjacent bases 
ligated together, no catalytic activity was seen. 

Figure 10 . Loss of activity when bases AAA at 
positions 203, 202 and 201 are changed to CGU respec- 
tively. When the circled AAA bases were replaced by the 
underlying 5'-CGU-3' bases, no catalytic activity was 
seen. 

Figure 11A . No effect on activity when base 
A49 in the substrate is changed to a n G" . The circled 
"A" base 49 in the substrate was changed to a "G n and no 
effect on activity was seen. The concentration of R51 
was 0.016uM, [S17]=*0.4uM, and [S17 (A->G) ] -0 . 2uM. 
Reaction under standard conditions was for* 40 min. Lane 
1 S17; Lane 2 S17/R51; Lane 3 S17(A->G); Lane 4 S17 
(A->G)/R51. 

Figure 11B . No effect on activity was seen 
when base A49 in the substrate was changed to a "U" 
(S17 (A-->U) ) . The concentration of substrate RNA S17 



14 - 



- 15 - 

(A-->U) used was 0.12uM and the concentration of R51 was 
0.0065uM. Reaction was for 60 minutes under standard 
conditions. The catalytic RNA was unlabeled* 

Figure 11C . No effect on activity was seen 
when base A49 in the substrate was changed to a "C" 
(S17 (A- * >C) ) . The concentration of substrate RNA S17 
(A-->C) used was 0.08uM and the concentration of R51 was 
0.0065uM. Reaction was for 60 minutes under standard 
conditions. The catalytic RNA was unlabeled. 

Figure 12 . Different target RNA sequences can 
be used as long as the base pairing is maintained with 
the catalytic RNA. The C:G base pair predicted by the 
"hairpin" model of the catalytic complex of (-)sTRSV, 
Figure 1, was changed to a G:C base pair (circled) and 
activity was maintained. In this experiment the usual 
substrate S17 was not used; instead a new substrate was 
used with the exact same sequence except tbe first two 
vector bases GC were eliminated. The resulting sequence 
of this new substrate S15 was gUGACAGUCCUGUUU. The sub- 
strate containing the C50->G50 base change was S15(C->G) 
and the catalytic RNA with the G214->C214 base change was 
R51(G->C). The reactions were run under standard condi- 
tions for 60 min at [R51] =0 . 07uM, [S15] «0 . 12uM, 
[S15 (0>G) ]=0.15uM, [R51 (G->C) ]=0.05uM. Lane 1 R51/S15; 
Lane 2 R51 (G->C) /S15 ; Lane 3 S15; Lane 4 R51 (G->C)/ 
S15(C->G); Lane 5 S15(C->G). 

Figure 13 . An RNA sequence found in tobacco 
mosaic virus (TMV) can be cleaved at a specific site. 
The substrate RNA shown is that beginning with nucleotide 
#53 8 in the tobacco mosaic virus sequence. The catalytic 
RNA was synthesized to base pair with the TMV substrate 
RNA in. the stem regions of the "hairpin" as shown by the 
circled base pairs. Reaction was for 20 min under 
standard conditions with a catalytic RNA concentration of 
3nM and a substrate concentration of 0.06uM. Lane 1 TMV 
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substrate RNA only; Lane 2 TMV catalytic RNA/TMV sub- 
strate RNA. 

Figures 14A-C . The sequences. of three sub- 
strate RNAs having sequences found in the messenger RNA 
for chloramphenicol acetyl transferase (CAT) are shown. 
They have 14-, 16- and 18-base long target sites, and the 
length of the 3' regions flanking the AGUC cleavage 
sequence is extended in substrates (B) and (C) as com- 
pared to substrate (A) . Catalytic RNAs designed to base 
pair with the substrate RNAs in both the 3' and 5' 
regions flanking the cleavage sequence AGUC were synthe- 
sized. The open boxed bases are those which are differ- 
ent from those in the native (-)sTRSV substrate RNA 
sequence shown in Fig. 1. 

Figure 15 . This figure shows the sequence of a 
substrate RNA having a sequence found in the region of 
the HIV-l viral RNA which specifies the gag protein. A 
catalytic RNA was made whose sequence was designed so 
that the catalyst would base pair with the substrate RNA 
in both the 3' and 5' regions flanking the cleavage 
sequence. The open boxed bases are those which are 
different than those of the native (-)sTRSV sequence 
shown in Figure 1. The catalytic RNA cleaved the sub- 
strate RNA at the arrow. 

Figure 16 . Shown is the sequence of a sub- 
strate RNA having the sequence found at the beginning of 
the coding region for the regulatory protein tat of the 
HIV-l virus. A catalytic RNA was made which was designed 
so that it would base pair with the substrate RNA in both 
the 3' and 5' regions flanking the UGUC catalytic 
cleavage sequence. The open boxes are bases which are 
different from those of the native (-)sTRSV substrate 
sequence shown in Figure 1. Cleavage occurred at the 
arrow as shown. 

Figure 17 . Shown is the sequence of a sub- 
strate RNA having four non-native bases (UUUU) added to 
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the 3' end of the sequence of the native (-)sTRSV sub- 
strate RNA shown in Fig. 1. A corresponding catalytic 
RNA was made whose sequence was designed so it would base 
pair with the substrate in both the 3' and 5' regions 
flanking the AGUC cleavage sequence. Cleavage rates with 
a constant catalytic RNA concentration and various con* 
centrations of substrate RNA were determined by cutting 
out the bands of the acrylamide gels, counting the 
radioactivity and plotting the data using Michaelis- 
Menton procedures to calculate Km and kcat. 

Figure 18 . Summary of the sequence require- 
ments for the target region of substrate RNA. Only a GUC 
sequence is required for cleavage of the substrate as 
long as the short sequences of bases in the regions of 
the substrate flanking the cleavage sequence are sub- 
stantially base paired with corresponding regions of the 
RNA catalyst. The regions of base pairing are labeled 
Helix 1 and Helix 2 in the figure. Also, the regions of 
base pairing predicted by the "hairpin" model for 
(-)sTRSV to exist in the "hairpin" portion of the 
catalyst are labeled Helices 3 and 4. 

Figure 19 . Confirmation of the existence of 
Helices 3 and 4 predicted by the "hairpin" model for 
(-)sTRSV RNA. A G->C base mutation in base 35 (count 
bases from the 5' end of the catalytic RNA sequence 
shown) of the (-)sTRSV catalytic RNA sequence shown in 
Fig. l resulted in an RNA with no catalytic activity 
(Lanes 3 and 4 ("mismatch")). A double mutant, G35->C; 
C27->G had restored catalytic activity (Lanes 5 and 6 
("substitute b.p.")). These two base changes are in the 
Helix 4 region whose existence is predicted by the 
"hairpin" model for (-)sTRSV. Also, a catalytic RNA 
having a single base change at position 47 (G47->C) was 
inactive (Lanes 9 and 10), while a double mutant, with a 
second mutation C17->G, had restored activity (Lanes 11 
and 12) . These two base changes are in the Helix 3 
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region whose existence is predicted by the "hairpin" 
model. The control (Lanes 1, 2, 7 and 8) is cleavage of 
the substrate RNA S17 having the native (-)sTRSV sequence 
by catalytic RNA sequence R51 having the native sequence. 

Figure 20 . The RNA sequence of a synthetic 
"hairpin" autocatalytic cassette is shown. The sequence 
shown in Figure 20 is the same as that of the catalyst 
shown in Figure 1, but with additional 5' bases added to 
form a loop at the 5' end of the catalyst and to provide 
a substrate target sequence ( i.e. . a cleavage sequence 
and upstream and downstream flanking bases) which can 
bind to the substrate binding portion of the catalyst 
sequence. Such an RNA was prepared. When transcription 
was performed, the cassette autocatalytically cleaved at 
the expected site to give the appropriate 3'F and 5'F 
products. 

Figure 21 . Shown are two base changes that 
were made in the native (-)sTRSV catalytic sequence shown 
in Figure l. The two bases changes were an "A" to "U" 
mutation at position 217 and a "G" to "C" mutation at 
position 216. Figure 21 also shows the separation 
patterns on acrylamide gels of the reaction products 
obtained by reacting one of these catalysts or R51 
(control) with substrate S17. Both base changes produced 
catalysts that were inactive when the catalysts were 
reacted with substrate S17 under standard conditions for 
15 minutes. 

Figure 22A shows a plasmid map illustrating 
pertinent portions of vector pHC-CAT containing the CAT 
gene linked to the "hairpin" autocatalytic cassette of 
the invention so that the "hairpin" autocatalytic RNA 
would be expected to serve as a chain terminator for the 
CAT gene. Also shown is the expected RNA transcript of 
the illustrated region ( "CAT- cassette RNA") and the 
expected 5' fragment of the autocatalytic cleavage 
("Cleaved CAT-cassette RNA"). Finally, the figure illus- 
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trates the location of DNA probes designed to hybridize 
with different regions of the CAT- cassette RNA transcript 
and Cleaved CAT- cassette 5' fragment. 

Figure 22B shows the results of Northern blot 
analysis of RNA isolated from Escherichia coli host cells 
transformed with pHC-CAT. When the CAT probe was used, 
both the full length transcript and the expected 5' 
cleavage fragment were detected, indicating that cleavage 
took place in vivo . When the "hairpin" autocatalytic 
cassette probe was used, only the full length CAT- 
cassette RNA transcript was detected. The fact that the 
5' fragment did not hybridize with this probe was to be 
expected, since most of the "hairpin" autocatalytic 
cassette transcript would be in the 3' fragment after 
cleavage. Although, it would be expected that the 
"hairpin" autocatalytic cassette probe would hybridize to 
the 3'' fragment, the fact that the 3' fragment was not 
detected by Northern blot analysis is not surprising. 
The 5' terminus of the 3' fragment would contain a 5' -OH 
an'd not the 5'-ppp which is ordinarily seen in RNA 
transcripts. Thus, the 3' fragment would be expected to 
be very labile in vivo and was likely degraded immedi- 
ately after the autocatalytic cleavage. 

Figure 23 . Shown is the map of the vector 
pMHC-CAT which was constructed by excising the "hairpin" 
autocatalytic cassette from pHC (prepared as described in 
Example 23) with Smal/Sall and ligating it to vector pMSG 
at the Smal/Xhol sites to give the vector pMHC. Then, 
the CAT gene was excised from pMAM-NEO-CAT with Smal/Xhol 
and was ligated into pMHC at the Smal/Xhol sites to give 
pMHC-CAT as shown. 

Figure 24 . Shown is the map of the vector 
pHC19R which was prepared by excising the "hairpin" 
autocatalytic cassette from pHC with BamHI/Sall and 
ligating it to the Sall/BamHI sites of pT219R to give 
pHC19R as shown. 
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Figure 25 . Shown are the results of an SI 
nuclease protection assay performed on RNA isolated from 
CHO cells that had been transfected with pMHC-CAT. In 

the gel shown in Figure 25, the lanes contain: Lane 1 

probe (148nt) ; Lane 2 SI nuclease digested probe; Lane 

3 — in vitro transcribed "hairpin" autocatalytic cassette 
that had been hybridized to probe and SI nuclease 

digested (uncleaved 134nt, 3'F 87nt, 5'F 47nt) ; Lane 4 

RNA isolated from uninduced, pMHC-CAT- transfected CHO 
cells, hybridized to probe and SI nuclease digested; and 

Lane 5 RNA isolated from dexamethasone induced, pMHC- 

CAT- transfected CHO cells, hybridized to probe and SI 
nuclease digested (uncleaved Hint, 3'F 69nt, 5'F 42nt). 
All mobilities were as expected. 

Figure 26 . Shown are catalytic RNA/viral 
vector constructions used to infect plants. In these 
constructions, the "hairpin" autocatalytic cassette from 
the vector pHC was ligated to the cauliflower mosaic 
virus (CMV) in vector pCSlOl to give the two engineered 
CMV viral constructs shown. The "hairpin" autocatalytic 
cassette is in the sense orientation in pCS101HC7 and in 
the antisense orientation in pCS101HC9. 

Figure 27A : Shown are the results of an SI 
nuclease protection assay performed on RNA isolated from 
turnip plants that had been infected with pCS101HC7 or 
pCS101HC9. Lane 1 is undigested "hairpin" autocatalytic 
RNA probe (148 nt) and Lane 2 is RNA transcribed from 
Hindlll-digested plasmid pHC which gave three RNA 
products uncleaved "hairpin" autocatalytic cassette 
(158nt) , 3'F (87nt) and 5'F (71nt) . The remaining lanes 
are all SI nuclease digests of probe hybridized to the 
following RNA preparations: Lane 3 RNA from uninfected 
plants; Lane 4 RNA from plants infected by the virus 
control (pCSlOl) ; Lane 5 RNA from plants infected with 
PCS101HC7 (uncleaved 112nt, 3'F 70nt and 5'F 42nt) ; Lane 
6 RNA from plants infected with pCS101HC9. 
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Figure 27B : Results of Northern blot analysis 
of the RNA isolated from plants that were mock infected, 
infected with pCSlOl (wild- type CMV) or infected with 
pCS101HC7. The probe was labelled "hairpin" autocata- 
lytic RNA. Lane 1, RNA from mock- infected plants; Lane 
2, RNA from plants infected with pCSlOl (wild-type CMV) ; 
and Lane 3, RNA from plants infected with pCS10lHC7. 

Figure 27C ; Total DNA from plants that were 
mock infected, infected with pCSlOl (wild- type CMV) or 
infected with pCS101HC7 was amplified by the polymerase 
chain reaction (PCR) . Lane M, molecular weight markers; 
Lane 1, PCR-amplif ied DNA from mock- infected plants; Lane 
2, PCR-amplif ied DNA from plants infected with pCSlOl 
(wild-type CMV) ; Lane 3, PCR-amplif ied DNA from plants 
infected with pCSl01HC7; Lane 4, PCR-amplif ied DNA from 
the pCSlOl plasmid; and Lane 5 PCR-amplif ied DNA from 
PCS101HC7 plasmid. 

Figures 27D and 27E : Results of Western 
immunoblot analysis of protein extracts of turnips leaves 
from plants that were mock infected, infected with pCSlOl 
(wild-type CMV) or infected with pCSl01HC7. Figure 27D 
shows the results for plants one month after inoculation, 
and Figure 27E shows the results for plants two months 
after inoculation. The lanes are the same in both 
figures. Lane 1, protein from mock- infected plants; Lane 
2, protein from plants infected with pCSlOl (wild- type 
CMV); and Lane 3, protein from plants infected with 
pCS101HC7. CP= coat protein, (M) = molecular weight 
markers . 

Figure 28 : HIV-1 target sequence. The 16 -base 
target sequence is found in the 5 '-leader region of all 9 
HIV-1 mRNAs. The nucleotide numbering starts at the 
first base transcribed in the HXB2 clone (HIV Sequence 
Data Base, prepared and distributed by Gerald Myers et 
al., Los Alamos National Laboratory, Los Alamos, NM, 
telephone (505) 665-0480) . 
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Figure 29 : 



Conservation of the HIV-1 target 



sequence in various HIV isolates. All sequences are from 
the HIV Sequence Data Base and are listed 5'--> 3'. All 
homologies are +107 to capsite (+1) , except MAL which is 
+105. The capsite sequence searched for was GGT CTC TCT. 
Only two isolates (MAL and MN) had variations in the 
target sequence, and the variations were, in each case, a 
single base change (G-->A). In the figure, eg indicates 
that the complete genome was contained in the sequence 

file, and n n indicates proviral DNA without any 

homology which contained sequence information that 
started too late or ended too soon. It is likely that a 
homologous sequence does indeed occur in these strains. 

Figure 30 : Sequence of HIV-1 substrate 
("SHIV") having the target sequence of Figure 28 plus 
additional GCG vector bases at its 5' end. Also shown is 
the sequence of engineered "hairpin" catalytic RNA 
("RHIV") designed to cleave this substrate. The 
catalytic RNA also has additional 5' vector bases 
3 ' - CUGAGGG- 5 ' as shown . 

Figure 31 : Time course of cleavage of the 
substrate RNA SHIV by RHIV (both depicted in Figure 30) . 
Shown are the catalytic RNA RHIV (R) , substrate RNA SHIV 
(S), 3' cleavage fragment <3'F), and 5' cleavage fragment 
(5'F). Since 35% of the substrate was uncleavable, the 
remaining 65% was normalized to 100% on the ordinate of 
the graph. 



substrate RNA SHIV by RHIV (both depicted in Figure 30) . 
Shown are the catalytic RNA RHIV (R) , substrate RNA SHIV 
(S), 3' cleavage fragment (3'F), and 5' cleavage fragment 
(5'F). The incubation time was 5 minutes, and the 
concentration of RHIV was 0.005 uM. The concentration of 
SHIV was: Lane 1--0.10 uM; Lane 2- -0.05 uM; Lane 3-- 
0.025 uM; Lane 4- -0.012 uM; Lane 5- -0.006 uM; and Lane 6- 
-0.025 uM (this is control lane at zero time). From the 



Figure 32 : Kinetics of the cleavage of the 
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graph, the Km was found to be 100 nM, and che kcat to be 
1.6/min. From the time course shown in Figure 31 f it was 
determined that 35% of the substrate was uncleavable. 
This was subtracted from these calculations. 

Figure 33 : Sequence of longer HIV-1 transcript 
and results of its cleavage by RHIV. Shown on the gel 
are the uncleaved transcript of 183nt and the two 
cleavage products (5'F of Hint and 3'F of 72nt) . The 
control lane at 0 minutes showed no cleavage. The gel 
was calibrated with standards, and all mobilities were as 
expected. 

Figure 34 : Map of pHR and partial sequence. 

Ficrure 35 : Results of cleavage of the 
substrate SHIV by the lOlnt catalytic RNA (designated as 
"PRHIV") produced by T7 RNA polymerase transcription of 
pHR. Shown on the gel are RHIV (R) , PRHIV (PR), SHIV 
(S) , 3' cleavage fragment (3'F), and 5' cleavage fragment 
(5'F). Times of incubation were 0 and 15 minutes. 

Figure 36 : Map of pMSGRHIV and partial 

sequence. 

Figure 37 : Map of plasmid pMRHPT. This 
mammalian expression vector contains DNA encoding an 
engineered "hairpin" catalytic RNA ("RHGPT") under 
control of the dexamethasone- inducible MMTV promoter and 
terminated by the "hairpin" autocatalytic cassette. 

Figure 38 : Selection and testing scheme for 

RHGPT. 

Figure 39 : Results of SI nuclease assay for 
reduction of HGPRT mRNA in CHO cells transfected with 
pMRHPT and pMSG-dhfr. Lane 1 contains the P 32 - labelled 
14 8nt probe which hybridizes to HGPRT mRNA; Lane 2 
contains P 32 -labelled "hairpin" autocatalytic RNA used as 
a standard (seen only on longer exposures); Lanes 3 and 4 
contain SI -nuclease digested RNA isolated from cells 
transformed with pMRHPT and pMSG-dhfr and induced (Lane 
4) or not induced (Lane 3) with dexamethasone. The arrow 
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shows the location of the probe-protected RNA which 
corresponds to HGPRT mRNA. 

Figure 40 ; Map of the plasmid pMCATRCAT. This 
plasmid contains DNA encoding a "hairpin" RNA catalyst 
engineered to cleave CAT mRNA opera tively linked to the 
"hairpin" autocatalytic cassette, all driven by the 
dexamethasone- inducible MMTV promoter. The CAT gene is 
on the same plasmid and is driven by the SV40 promoter. 

Figure 41 : Results of Si nuclease assay for 
reduction of CAT mRNA in CHO cells transfected with 
pMCATRCAT and pMSG-dhfr. Lane 1 contains the P^-labelled 
119nt probe which hybridizes to CAT mRNA; Dane 2 contains 
RNA from untransf ected CHO cells; Lane 3 contains mRNA 
transcribed from the CAT gene on vector pHC-CAT in vitro : 
Lanes 4 and 5 contain Sl-nuclease digested RNA isolated 
from cells transfected with pMCATRCAT and pMSG-dhfr and 
induced (Lane 5) or not induced (Lane 4) with 
dexamethasone . 

Figures 42A-C : Summary of mutagenesis experi- 
ments with the (-)sTRSV "hairpin" substrate -catalyst 
complex. Each base or combination of bases enclosed in a 
circle represents a separate mutational experiment* 

Figure 42D : A more refined secondary structure 
model for the (-)sTRSV "hairpin" substrate- catalyst 
complex. 

Figure 43 ; Separation patterns on an 
acrylamide gel of the reaction products obtained by 
reacting substrate with a "hairpin" catalytic RNA having 
the loop that closes the "hairpin" replaced by the hair- 
pin sequence GGAC (UUCG) GUCC . 

DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS 

An RNA catalyst has been identified comprising 
an RNA sequence which can be engineered to cut, with 
great precision, target RNAs having a cleavage sequence. 
In particular, the invention comprises certain synthetic 
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RNA catalysts capable of cleaving an RNA substrate which 
contains the target sequence 5' -Fj-CS-F 2 -3 ' . 

"Synthetic RNA catalyst," as used herein, means 
a catalyst which is not a naturally- occurring RNA 
catalyst, although "synthetic catalysts" may be truncated 
or altered versions of naturally- occurring catalysts. 
"Synthetic catalysts" are catalysts designed according to 
the principles set forth herein based on the "hairpin" 
model to bind to and cleave a selected target sequence in 
a selected RNA substrate. "Synthetic catalysts" are also 
sometimes referred to herein as "engineered catalysts." 
"Synthetic catalysts" include catalysts synthesized in 
vitro and catalysts, synthesized in vivo . In particular, 
"synthetic catalysts" include catalysts produced by hosts 
transformed by a vector comprising a sequence coding for 
the catalyst. 

RNA of any length and type may be used as the 
substrate as long as it contains the 5' -Ft-CS-Fj-3' target 
sequence. In this formula, CS is the cleavage sequence 
( i.e. . a sequence of bases containing the site at which 
the catalyst cleaves the substrate) . CS is a short se- 
quence of bases which does not base pair with the RNA 
catalyst, and CS preferably has the sequence 5'-NGUC-3', 
wherein N is any base, and the substrate is cleaved by 
the catalyst between N and G to produce a fragment having 
an OH at the 5' end and a fragment having a 2, '3' cyclic 
phosphate at the 3* end. 

CS is flanked by two short base sequences F x and 
F 2 which do base pair with the RNA catalyst F x is 
preferably at least 3 bases in length, most preferably 4 
bases in length, F 2 is also preferably at least 3 bases 
in length, most preferably 6 to 12 bases in length. 

Catalysts according to the invention comprise a 
substrate binding portion and a "hairpin" portion. The 
substrate binding portion of the catalyst lias the 
sequence: . 



- 25 - 



- 26 - 



3'F 4 -L 1 -F 3 -5' 

wherein, 

F 3 is a sequence of bases selected so that F 3 is 
substantially base paired with F 2 when the catalyst is 
bound to the substrate; 

F 4 is a sequence of bases selected so that F 4 is 
substantially base paired with Fj when the catalyst is 
bound to the substrate; 

the sequences of F 3 and F 4 being selected so 
that each contains an adequate number of bases to achieve 
sufficient binding of the RNA substrate to the RNA cata- 
lyst so that cleavage of the substrate can take place; 
and 

Lj is a sequence of bases selected so that h x 
does not base pair with CS when the catalyst is bound to 
the substrate. 

As used herein, "substantially base paired" 
means that greater than 65% of the bases of the two RNA 
sequences in question are base paired, and preferably 
greater than 75% of the bases are base paired* "Sub- 
stantially unpaired" means that greater than 65% of the 
bases of the two sequences in question are not base 
paired, and preferably greater than 75% of the bases are 
not paired. 

F 3 is preferably at least 3 bases in length, 
most preferably from 6 to 12 bases in length. F 4 is 
preferably from 3 to 5 bases in length, most preferably 4 
bases in length. 

Lj is a short sequence of bases which preferably 
has the sequence 5'-AGAA-3' when CS has the sequence 5'- 
NGUC-3'. Further, when l» x is 5'-AGAA-3' and CS is 
5'-NGUC-3', then the first base pair between F x and F 4 
adjacent to CS and L, is preferably G:C or C:G (see Fig- 
ure 42D and Example 32) . Accordingly, a preferred target 
sequence in a selected substrate contains the sequence 
5'-SNGUC-3', wherein S is G or C. 
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The "hairpin" portion is a portion of the 
catalyst which folds into a hairpin- like configuration 
when the substrate- catalyst complex is modeled in two 
dimensions for minimum energy folding. This is shown in 
Figures 1 and 42D for (-)sTRSV RNA. The "hairpin" 
portion is not an absolute hairpin in the sense that not 
all bases of the "hairpin" portion are base -paired. 
Indeed, it is preferable, perhaps necessary, for the 
"hairpin" portion to have at least one substantially 
unpaired region so that the catalyst can assume a ter- 
tiary structure that allows for better, or optimal, 
catalytic activity. 

The "hairpin" portion of the catalyst prefer- 
ably has the sequence: 

wherein, 

Pj and P 4 each is a sequence of bases, the 
sequences of Pj and P 4 being selected so that P x and P 4 are 
substantially base paired; 

Z> x is covalently attached to F 4 ; 

S x and S 2 each is a sequence of bases, the 
sequences of S x and S 2 being selected so that S x and S 2 are 
substantially unpaired; 

P 2 and P 3 each is a sequence of bases, the 
sequences of P 2 and P 3 being selected so that P 2 and P 3 are 
substantially base paired; and 

is a sequence of unpaired bases. 

"Substantially base paired" and "substantially 
unpaired" have the same meanings as discussed above. 

Pj and P 4 each is preferably from 3 to 6 bases 
in length, and most preferably V x has the sequence 
5'-ACCAG-3' and P 4 has the sequence 5 ' -CUGGUA-3 9 . It has 
been found that the A at the 5' end of 5'-&CCAG-3' 
(underlined) is not base paired to the U at the 3' end of 
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5'-CUGGUA-3' (underlined), and the unpaired A may act as 
a "hinge" (see Example 32) . It is not known yet, but the 
presence of such a "hinge" may be necessary for catalytic 
activity. 

Sj and S 2 each preferably is from 4 to 9 bases 
in length, and most preferably S, has the sequence 
5' -AGAAACA-3' and S 2 has the sequence 5' -GUAUAUUAC-3' . 

P 2 and P 3 each is preferably from 3 to 9 bases 
in length, and more preferably P 2 has the sequence 
5'-CAC-3' and P 3 has the sequence 5'-GUG-3'. 

Finally, 1^ is preferably at least 3 bases in 
length and preferably has the sequence 5'-GUU-3'. 
Further, 5 ' -Sj-Pj-Lj-S' preferably has the sequence 
5 ' - AG AAACACACGUU - 3 ' • 

The specific preferred sequences set forth 
above for P,, P 2 , S lf etc, are from the catalytic sequence 
of (-)sTRSV RNA. 

A preferred catalyst according to the invention 
contains the sequence: 

5' - F 3 - Lj - F 4 - ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA - 3 ' , 

and active variants thereof, wherein F 3 , F 4 and L x are as 
defined above. As used herein "active variants" means 
catalysts which, although having substitutions, deletions 
and/or additions of bases as compared to tlie original 
sequence, are still capable of cleaving an RNA substrate. 

The most preferred sequence for 5 ' -P 2 -Ij2-P 3 -3 ' is 
5' -CACGGACUUCGGUCCGUG-3' [SEQ ID 46] (see Example 32). 
Accordingly, the most preferred catalyst has the 
sequence: 

5'-F 3 -L r F 4 -ACCAGAGAAACACACGGACUUCGGUCCGUG- 
GUAUAUUACCUGGUA- 3 ' [SEQ ID 4 7] 
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wherein F 3 , F 4 and L x are as defined above and the 
underlined portion is the preferred sequence for P 2 -L2-P 3 . 

Another preferred catalyst according to the 
invention is an autocatalytic catalyst containing the 
sequence: 



P 3 ---S 2 —-P 4 F r --CS---F 2 



\p 2 ... Sr ..p r .. Pr .. Lr ..p. 




wherein, F,, F 2 , F 3 , F 4 , L,, Lj, S 1# S 2 , P lf P 2 , P 3 and P 4 are 
as defined above, Lj is a sequence of unpaired bases, and 
L 3 preferably has the sequence 3'-CCUCC-5'. Thus, the 
molecule contains a substrate portion (5' -Fj-CS-Fj-3 ' ) and 
a catalytic portion (5' -F 3 -L J -F 4 -P,-S 1 -P2-L2-P3-S 2 -P 4 -3 f ) 
covalently linked together by L 3 so as to produce a 
synthetic autocatalytic RNA catalyst. 

After being transcribed, this catalyst will 
spontaneously undergo an intramolecular autocatalytic 
cleavage at CS. The effect of this intramolecular 
cleavage ~Ts to autocatalytically terminate any RNA 
transcript in which the sequence is inserted. For 
instance, an engineered DNA molecule may be prepared 
which comprises a gene of interest and a DNA sequence 
coding for the catalyst linked so that, when transcribed, 
the catalyst will terminate transcription of the 
messenger RNA coding for the gene product at a desired 
location. 

The invention further provides an engineered 
DNA molecule and a vector comprising a DNA sequence 
coding for an RNA catalyst of the invention. Also pro- 
vided are host cells which have been transformed with the 
vectors and which are capable of expressing the RNA 
catalyst. Finally, the invention provides a method of 
cleaving an RNA substrate which contains tlie sequence 
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5' -F r CS-F 2 -3 ' , the method comprising contacting the 
substrate with a synthetic RNA catalyst according to the 
invention. 

The invention is further described below with 
particular reference to the catalytic properties of 
(-)sTRSV RNA and the structure of its catalytic complex, 
but the invention is not limited to synthetic catalysts 
based on (-)sTRSV RNA. In addition to the particular 
catalytic sequences shown and described below, other RNA 
molecules having catalytic activity to cleave an RNA 
substrate can be readily found by applying the principles 
set forth in this specification. 

For example, RNA sequences having the required 
structural features for cleaving an RNA substrate can be 
identified by applying the Wisconsin RNA Folding Program 
discussed above to (1) known sequences of molecules 
having catalytic or autocatalytic activity (especially 
molecules in which the actual location of the catalytic 
site is unknown) , (2) randomly generated sequences having 
the proper pairing regions and lengths, and (3) randomly 
modified known catalytic sequences, while looking for 
-known features of a catalytic molecul e , such as the 
"hairpin" configuration of the catalytic complex when 
modeled in two dimensions. As a specific example, infor- 
mation regarding known autocatalytic cleavage sites can 
be used to find substrate binding sequences having the 
properties described above, such as having a substrate 
binding sequence adjacent a "hairpin" portion. Secondary 
features, such as the two substantially paired regions in 
the "hairpin" portion, with an intermediate substantially 
unpaired region and an appropriate base loop can then be 
looked for, either by manual examination or with auto- 
mated computer programs ? In order of decreasing prefer- 
ence, the following features are considered important in 
selecting a catalytic sequence: (1) regions that can 
base pair (form helices) with the regions of the sub- 
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strate RNA molecule flanking the cleavage sequence; 
(2) an unpaired (loop) region opposite the cleavage 
sequence of the substrate RNA; (3) two substantially base 
paired regions in a "hairpin" structure near the cleavage 
sequence; (4) a substantially unpaired region between 
these two substantially paired regions in the "hairpin" 
structure; and (5) a loop connecting the two strands of a 
substantially base paired region to complete the 
"hairpin" structure. Standard techniques of in vitro RNA 
synthesis can then be used to prepare actual molecules 
having the sequence that gives the predicted two- 
dimensional computer -generated structure for verification 
of activity and routine testing of variation to determine 
optimum sequence. 

The catalysts of the present invention were 
developed using a "hairpin" model or motif of RNA 
catalysis. According to this model, the catalytic 
complex, when modeled in two dimensions for minimum 
energy folding, assumes a "hairpin" configuration. This 
is shown in Figures 1 and 42D for (-)sTRSV RNA. The 
catalytic complex is a complex of the minimum, or 
substantially the minimum, sequence of the catalyst 
necessary for activity and the minimum, or substantially 
the minimum, target sequence of the substrate. The 
"hairpin" configuration is not an absolute hairpin in the 
sense that not all the bases that make up this "hairpin" 
configuration are base -paired. Indeed, there are 
preferably regions of unpaired bases and of substantially 
unpaired bases as discussed in detail elsewhere in the 
present application. 

The "hairpin" model has proved very useful in 
designing new catalysts, but it is still only a computer 
model of the likely secondary structure of catalytic 
complexes involving catalysts according to the present 
invention. Also, it is ultimately the tertiary structure 
of RNA catalysts that determines their activity. For 
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these reasons, all catalysts having the properties 
described herein are considered to come within the scope 
of the present invention, even if they do not form a 
"hairpin" configuration when complexed with the substrate 
and even if they do not contain a "hairpin" portion. For 
instance, it may be possible to engineer a catalyst 
having the properties described herein which does not 
have a loop 1^. Such a catalyst would be considered to be 
fully equivalent to the catalysts described and claimed 
herein. 

As described in Example 1, a catalytic complex 
was identified within the (-) strand of tobacco ringspot 
virus satellite RNA (sTRSV) when the molecule was folded 
using computer models to determine the minimum energy 
folding in two-dimensional space. The (-) strand is a 
359 base long RNA of defined sequence and is known to 
have autocatalytic properties (Gerlach, W. L. , Buzayan, 
J. W. , Schneider, I. R. and Bruening, G. B. (1986) 
Virology . 151 : 172-185; Buzayan, J. M. , Gerlach, W. and 
Bruening, G. (1986) Nature , 323 ; 349-352). The (-) 
strand cleaves at a defined site (ApG) into a cleavage 
product having an OH at the 5' end and a 2 ',3' cyclic 
phosphate at the 3' end. Up until the present time, 
however, little work had been done with the (-) strand to 
find the catalytic complex and to determine the minimum 
cleavage sequences because it does not fit the consensus 
"hammerhead" model. 

In view of the above and the fact that the 
catalytic center would contain both the catalytic RNA 
sequence and the substrate (target) RNA sequence and by 
studying the results of Example 1, a 50 nucleotide se- 
quence between bases 175 and 224 was picked and a 14 
nucleotide sequence between bases 40 and 53 was picked. 
Using methodologies found in published procedures, a 
catalytic RNA having a satellite RNA base sequence 
identical to the base 1 sequence in naturally- occurring 
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reaction rate also increased with increasing concentra- 
tion of divalent cations as shown in Example 8. 

The reaction takes place at physiological 
temperatures, preferably 16°C to 45°C, with a temperature 
optimum at 37°C as described in Example 6, Temperatures 
above about 45 °C inactivate the reaction. However, the 
temperature optimum of the reaction is affected by the 
degree of base pairing between the substrate and catalyst 
(see Example 18) . In particular, the length of the 
region of the catalyst that base pairs with the 3' region 
of the substrate flanking the cleavage sequence can be 
varied so that an engineered catalyst reacting at a 
desired temperature can be obtained (see Example 18) . 
Further, a "hairpin" catalyst which is more thermal 
stable than the native (-)sTRSV catalyst can be prepared 
by deleting the loop that closes the "hairpin" (Loop III 
in Figure 42D) and inserting therefor the stable hairpin 
sequence 5-GGAC (UUCG)GUCC-3 ' [SEQ ID 45] (see 
Example 32) . 

The 50 base catalytic RNA configured in the 
"hairpin" model in Figures 1 and 42D is the minimal 
sequence, or substantially the minimal sequence, of 
(-)sTRSV RNA necessary to achieve catalysis. When the 3' 
terminal A (base 175) or AU (bases 175 and 176) of this 
sequence was removed, catalytic activity was substan- 
tially decreased as shown in Example 10. The 5' end of 
the molecule cannot be substantially changed either 
without affecting catalytic activity because it is needed 
to provide base pairing with the substrate. It can be 
shortened by at most about 3 bases (see Example 9) . 
Experiments which removed bases 195-203 in the center of 
the catalytic RNA and ligated base 194 to base 204 
produced an inactive catalytic RNA as described in 
Example ll. This shows that all or part of the region 
between bases 195-203 is essential for catalytic 
activity. An additional mutagenesis experiment to test 
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the base requirement of the three A bases at positions 
203, 202 and 201 was done. When these bases were changed 
to CGU, respectively, as described in Example 12, the 
resulting catalytic RNA was inactive. 

Base changes can be made in the two base paired 
regions (Helices 3 and 4 in Figure 42D) of the "hairpin" 
portion of the (-)sTRSV catalytic RNA, as long as sub- 
stantial base pairing is maintained. This is shown in 
Examples 22 and 32 where base changes that destroyed base 
pairing in these two regions resulted in inactive 
catalysts. When a second base change was made which 
restored base pairing, the catalytic activity was also 
restored (see Figures 19 and 42A-C) . The one exception 
appears to be the first C:G base pair at the end of Helix 
3 nearest the substrate binding portion (base pair 
C16:G48 in Figure 42D) . Currently available evidence 
indicates that the identity of these two bases must be 
maintained (see Example 32) . 

An active ribozyme is produced when Helix 4 is 
extended and the sequence of the loop that closes the 
"hairpin" (Loop III in Figure 42D) is changed. As shown 
in Figure 42A, Loop III was replaced with the common and 
very stable RNA hairpin sequence 5' -GGAC(UUCG)GUCC-3 ' . 
As a result of this substitution, Helix 4 was extended by 
four base pairs and the GUU sequence of Loop III was 
replaced with the sequence UUCG (see Figure 42A) . The 
resulting RNA catalyst was more active and, as noted 
above, more thermally stable than the unmutated form (see 
Example 32) . It was concluded from this experiment that 
Loop III does not have a conserved or unique base 
sequence and that Helix 4 can be extended by at least 
four base pairs without loss of activity. 

However, the simple replacement of the GUU 
sequence of Loop III with the sequence UUCG gives an 
inactive ribozyme (see Example 32) . It is believed that 
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this shows that the sequence of Loop III has an influence 
on the stability of Helix 4. 

When Loop III is cut between U31 and U32, 
activity is lost (see Figure 42C) . A likely explanation 
for this is that when Loop III is cut. Helix 4 opens up 
and catalytic activity is, consequently, lost. 

Base changes can also be made in the two re- 
gions of the catalytic RNA that base pair with the sub- 
strate, as long as substantial base pairing with the 
substrate in the regions flanking the cleavage sequence 
is maintained and base pairing with the cleavage sequence 
is avoided as shown in Examples 9, 16-21 and 32. Indeed, 
every base pair in Helices 1 and 2 (see Figure 42D) can 
be changed to any other base pair, except the base pair 
in Helix 2 adjacent to the NGUC cleavage sequence (see 
Example 32) . This base pair must be G:C or C:G and 
cannot be A:U or U:A, except it is believed that this 
base pair is needed for stability and that A:U or U:A 
base pairs can be used if other measures are taken to 
stabilize the substrate -catalyst interaction- Indeed, it 
has been found that A:U and U:A base pairs can be used at 
these positions in the synthetic autocatalytic catalyst 
of the invention. 

It is the ability to change the base pairs in 
Heiices 1 and 2 that allows the RNA catalyst to be 
engineered to cut a specific target RNA substrate having 
a cleavage sequence such as NGUC. This is illustrated in 
Example 16 where the catalytic RNA was engineered by 
changing base 214 from a G to a C resulting in a 
catalytic RNA which failed to react with the substrate 
RNA developed from natural (-)sTRSV RNA. Activity was 
restored, however, when the substrate RNA was changed so 
that it could base pair with the subject engineered 
catalytic RNA. Also see Examples 9, 17-21 and 32. 

A type of "hinge" region, consisting of a 
single base, seems to be present between Helices 2 and 3 
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of the (-)sTRSV catalytic RNA (see Figure 42D and Example 
32) . The A at position 15 is not paired to the U at 
position 49, and the unpaired A may act as a "hinge." 

Further mutation studies have showed that the 
substantially unpaired regions between Helices 3 and 4 
(Loops II and IV in Figure 42D) are larger than 
originally predicted by computer modeling and energy 
minimization (compare Figures 1 and 42D) . Further, some 
of the bases in these loops appear to be required for 
activity (see Example 32) . In particular, changing bases 
C25 and A43 results in a loss of activity (see Figure 42C 
and Example 32; also see Example 12) . 

The (-)sTRSV catalytic RNA sequence has an 
5'-AGAA-3' sequence opposite the AGUC cleavage sequence 
of the substrate. As shown in Examples 24 and 32, at 
least part of this AGAA sequence is invariant. In par- 
ticular, when the A's at the 3' and 5' ends of the 
sequence (&GAA) were changed, the resulting catalysts 
were active (see Figure 42A) . However, when G or A 
(AGAA) in the center of the sequence was changed, the 
resulting catalysts were inactive (see Figures 21 and 
42C) . 

The target RNA substrate of the "hairpin" 
catalytic complex shown in Fig. 1 has an AGUC cleavage 
loop which does not base pair to the catalytic RNA in 
two-dimensional space. As shown in Examples 13-15, "A" 
in the AGUC cleavage sequence can be changed to any other 
base without effecting the ability of the RNA catalyst to 
cleave the substrate. Example 25 shows that the GUC 
sequence is conserved. Thus, the only sequence require- 
ment for the cleavage sequence of the substrate RNA is 
GUC. 

Although there is no base pairing between the 
NGUC cleavage sequence and the AAGA sequence in the 
catalyst opposite from the cleavage sequence, the C in 
NGUC apparently interacts with the A in AAGA opposite to 
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it. In particular, it has been found that the single 
mutation of C-->A in the cleavage s quence NGUC 
(position 9 in the substrate sequence see Figure 42B) 
destroyed the ability of the RNA catalyst to cleave the 
substrate. When a second mutation was made in the 
ribozyme, changing the A at position 7 to a C, partial 
activity was restored (see Example 32) , indicating some 
sort of interaction between C at position 9 in the 
substrate and the A at position 7 in the catalyst (see 
Figure 42B) . 

The cleavage sequence has four flanking bases 
at its 5' end and six at its 3' end which base pair with 
the catalytic RNA, As described above, the bases in the 
flanking regions can be changed (with the possible 
exception of the G:C base pair in Helix 2 adjacent to 
NGUC) without affecting the ability of the catalytic RNA 
to cleave the substrate, as long as sufficient base 
pairing with the catalyst is maintained in the flanking 
regions. This would be expected to work on RNA substrate 
sequences of any length as long as these criteria are 
met. Indeed, lengthening the 3' region of the substrate 
that base pairs with the catalyst has been found to 
provide a more efficient catalytic reaction. See 
Examples 18, 21 and 32. However, a smaller 10 base sub- 
strate having three flanking bases at its 5' end and 
three flanking bases at its 3' end did not work as well 
as the 14-base substrate, as described in Example 9. 

Using the "hairpin" model as a guide, RNA 
catalysts can be engineered that base pair with an RNA 
substrate and mediate a cleavage in the cleavage se- 
quence. In particular, catalytic RNA can be engineered 
that will cleave any RNA substrate having a cleavage 
sequence, such as NGUC, and flanking regions with which 
the catalyst base pairs, so that the catalytic RNA and 
RNA substrate form a catalytic complex in a "hairpin" 
motif. To do this, the bases flanking the cleavage 
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sequence must be identified and the catalytic RNA engi- 
neered so that it does not pair in two-dimensional space 
with the cleavage sequence but does pair with adequate 
numbers of flanking bases upstream and downstream of the 
cleavage sequence. When designing a synthetic catalyst 
based on the (-)sTRSV catalyst, the other principles set 
forth herein regarding conserved or preferred sequences 
should also be taken into account in designing the 
synthetic catalyst. 

As shown in Examples 17-20 and 29, catalytic 
RKAs according to the invention can cleave specific viral 
and messenger RNA sequences. In Example 15, tobacco 
mosaic virus (TMV) RNA containing the 5' coding region of 
the replicase gene was targeted for specific cleavage by 
an appropriately engineered catalytic RNA. The target 
sequence contained changes in 8 of the 14 bases of the 
substrate RNA having a base sequence found within the 
catalytic complex of (-)sTRSV RNA and was cleaved by the 
engineered RNA catalyst under conditions near physio- 
logical. Catalytic RNAs were also designed and 
synthesized using the "hairpin" model as a guide which 
could cleave sequences from messenger RNA coding for 
chloramphenicol acetyl transferase (Example 18) and from 
HIV-l viral RNA (Examples 19 , 20 and 29) . In particular, 
a conserved sequence in HIV-l viral messenger RNAs has 
been identified, and a "hairpin" catalytic RNA designed 
which cleaves this sequence (see Example 29) . 

These examples demonstrate that the system can 
be used to specifically cleave an RNA sequence for which 
an appropriately engineered catalytic RNA base pairs at 
the designated flanking regions. Suitable target RNA 
substrates include viral, messenger, transfer, ribosomal, 
nuclear, organellar, other cellular RNA, or any other 
natural RNA having a cleavage sequence, as well as RNAs 
which have been engineered to contain an appropriate 
cleavage sequence. 
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Catalysts that fit the "hairpin" catalytic RNA 
model are useful in vivo in prokaryotes or eukaryotes of 
plant or animal origin for controlling viral infections 
or for regulating the expression of specific genes. In 
this case, a cleavage sequence such as NGUC in the virus 
or complementary to NGUC in the gene would need to be 
identified along with the flanking sequences immediately 
upstream and downstream of the cleavage sequence. Norm- 
ally three to four bases on the 5' side of the cleavage 
sequence and enough bases in the order of 6 to 12 on the 
3' side to provide adequate binding of the catalytic RNA 
and to provide reasonable certainty that the target RNA 
sequence is unique in the organism are required. 

A catalytic RNA is then engineered which does 
not base pair with the cleavage sequence and which does 
base pair to the flanking regions on the 5' and 3 f side 
of the cleavage sequence. A DNA template corresponding 
to this catalytic RNA is then synthesized using proced- 
ures that are well-known in the art. Such procedures in- 
clude the phosphoramidite method (see, e.g. . Beaucage and 
Caruthers, Tetrahedron Letters, 22, 1859 (1981); 
Matteucci and Caruthers, Tetrahedrqn Letters, 21, 719 
(19 80) ; and Matteucci and Caruthers, J. Amer . Chem. Soc. . 
103, 3185 (1981)) and the phosphotriester approach (see, 
e.g. , Ito et al., Nucleic Acids Res. , 2SL, 1755-69 
(1982)). 

The invention also includes an engineered DNA 
molecule and a vector comprising a DNA sequence coding 
for the desired synthetic RNA catalyst. The vector will 
have the DNA sequence coding for the desired catalytic 
RNA operatively linked to appropriate expression control 
sequences. Methods of effecting this operative linking, 
either before or after the DNA coding for the catalyst is 
inserted into the vector, are well known. Expression 
control sequences include promoters, activators, en- 
hancers, operators, stop signals, cap signals, poly- 
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adenylation signals, and other signals involved with the 
control of transcription. 

The vector must contain a promoter and a trans- 
cription termination signal, both operatively linked to 
the synthetic DNA sequence, i.e. , the promoter is up- 
stream of the synthetic DNA sequence and the termination 
signal is downstream from it. The promoter may be any 
DNA sequence that shows transcriptional activity in the 
host cell and may be derived from genes encoding homo- 
logous or heterologous proteins and either extracellular 
or intracellular proteins, such as amylase, glyco- 
amylases, proteases, lipases, cellulases, and glycolytic 
enzymes. Also, a promoter recognized by T7 RNA 
polymerase may be used if the host is also engineered to 
contain the gene coding for T7 RNA polymerase. 

The promoter may contain upstream or downstream 
activator and enhancer sequences. An operator sequence 
may also be included downstream of the promoter, if 
desired. 

Expression control sequences suitable for use 
in the invention are well known. They include those of 
the E . coli lac system, the E. coli trp system, the TAC 
system and the TRC system; the major operator and pro- 
moter regions of bacteriophage lambda; the control region 
of filamentous single- stranded DNA phages; the expression 
control sequences of other bacteria; promoters derived 
from genes coding for Saccharomyces cerevisiae TPI, ADH, 
PGK and alpha- factor; promoters derived from genes coding 
for the Aspergillus oryzae TAKA amylase and A. niaer 
glycoamylase, neutral alpha -amylase and acid stable 
alpha -amylase; promoters derived from genes coding for 
Rhizomucor miehei aspartic proteinase and lipase; mouse 
mammary tumor promoter; SV40 promoter; the actin pro- 
moter; and other sequences known to control the 
expression of genes of prokaryotic cells , eukaryotic 
cells, their viruses, or combinations thereof. 
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The vector must also contain one or more rep- 
lication systems which allow it to replicate in the host 
cells. In particular, when the host is a yeast, the 
vector should contain the yeast 2u replication genes 
REP1-3 and origin of replication. 

The vector should further include one or more 
restriction enzyme sites for inserting the DNA template 
sequences into the vector, and preferably contains a DNA 
sequence coding for a selectable or identifiable pheno- 
typic trait which is manifested when the vector is 
present in the host cell ("a selection marker") . 

Suitable vectors for use in the invention are 
well known. They include retroviral vectors, vaccinia 
vectors, pUC (such as pUC8 and pUC4K) , pBR (such as 
pBR322 and pBR328) , pTZ (such as pTZ18R) , pUR (such as 
pUR288) , phage lambda, YEp (such as YEp24) plasmids, and 
derivatives of these vectors. 

The resulting vector having the engineered DNA 
sequence that codes for the RNA catalyst is used to 
transform an appropriate host. This transformation may 
be performed using methods well known in the art. 

Any of a large number of available and well- 
known host cells may be used in the practice of this 
invention. The selection of a particular host is de- 
pendent upon a number of factors recognized by the art. 
These include, for example, compatibility with the chosen 
expression vector, toxicity to it of the catalytic RNA 
encoded for by the engineered DNA sequence, rate of 
transformation, expression characteristics, bio- safety 
and costs. A balance of these factors must be struck 
with the understanding that not all hosts may be equally 
effective for the expression of a particular catalytic 
RNA. 

Within these general guidelines, useful hosts 
include bacteria (such as E. coli sp.), yeast (such as 
Saccharomyces sp.) and other fungi, insects, plants, 
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animals (including human) , or other hosts known in the 
art. 

As an example of the general genetic engineer- 
ing techniques that are possible, mammalian vectors can 
be used to deliver DNA coding for the catalytic RNAs of 
the invention to animal cells. These vectors should have 
a suitable DNA replication signal, such as from SV40, a 
promoter which may or may not be inducible, such as the 
mouse mammary tumor promoter (which is induced by 
dexamethasone) or the noninducible SV40 promoter. A 
multiple cloning site is located after the promoter, and 
the DNA coding for the engineered catalytic RNA is 
inserted into this multiple cloning site using standard 
techniques . If necessary a suitable terminator is 
inserted. The resulting vector is then put into cells 
using standard techniques such as direct injection into 
the nucleus, electroporation, or other well -known trans- 
formation techniques. Once the vector is in the cell, 
the catalytic RNA is expressed directly when noninducible 
promoters are used, or after addition of the inducer when 
inducible promoters are used. 

Similarly, plant vectors, such as the Ti plas- 
mid or micro-Ti plasmids, can be used to deliver DNA 
coding for a desired catalytic RNA to plant cells. The 
Ti plasmids and micro-Ti plasmids may be used as such to 
transform plant protoplasts using known techniques or may 
be inserted into Aarobacterium tumefaciens which is then 
used to transform plant tissue. Once the plasmid is in 
the cell, the catalytic RNA will be expressed. 

Once transformed, a host cell can express 
(transcribe) the synthetic RNA catalyst. When the DNA 
coding for the catalyst is transcribed, it produces 
catalytic RNA which will attack and cleave the target RNA 
sequence or sequences for which it has been designed, 
inactivating the RNA. If the RNA is necessary for the 
life cycle of a virus, the virus will be eliminated and 
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if the RNA is the product of a specific gene, the 
expression of that gene will thus be regulated. The 
catalytic RNA can be designed to work in prokaryotes and 
within the nucleus (without poly (A) tail) or in the 
cytoplasm of a eukaryotic cell (with polyadenylation 
signals in place) for plants and animals. 

Another potential method of using the catalytic 
RNAs of the invention is to prepare stable synthetic 
derivatives of RNA catalysts designed to bind and cleave 
a specific target RNA and to deliver the modified cata- 
lysts directly to the cell or organism of choice. For 
example, standard methods are available for making 
phoshporothioate derivatives of DNA which have been shown 
to be very stable in vivo and to be able to bind to a 
specific DNA or RNA target in vivo (antisense method) . A 
modification of these procedures can be used to prepare a 
catalytically active derivative of RNA catalysts prepared 
according to the invention. This would entail determin- 
ing which ribonucleotide regions can be altered and then 
altering them with deoxy, phophorothio, or other modifi- 
cations which confer stability but do not destroy 
catalytic activity. This chemically modified catalytic 
RNA (which may or may not have any remaining RNA bonds) 
can then be injected or otherwise delivered to an 
organism to' control viruses or gene expression. For 
instance, one of the catalytic RNAs whose preparation is 
described in Examples 19-20 having specificity for se- 
quences found within the RNA of the HIV-1 virus that 
causes AIDS could be chemically modified as described, 
encapsulated in a liposome coated with monoclonal anti- 
body directed to the CD4 receptors found on cells suscep- 
tible to HIV-1, and injected into a host animal. 

The "hairpin" catalytic RNA model of the pre- 
sent invention may also be of possible interest to mole- 
cular biologists in exploring the genetic blueprints of 
plants and animals. This would be done by sending ran- 
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domly constructed DNA reverse transcripts of catalytic 
RNA into the DNA of the organism and waiting to see which 
gene or genes were inactivated. Other techniques could 
be applied to determine where those genes resided on the 
organism's chromosomes, thereby greatly accelerating gene 
mapping . 

Finally, a synthetic autocatalytic RNA catalyst 
has been developed- The synthesis of one such catalyst 
based on the (-)sTRSV RNA catalytic and substrate 
sequences is described in Example 23. When the DNA 
coding for this catalyst was transcribed in vitro , auto- 
catalytic cleavage was observed. Further, in Example 26, 
DNA coding for this catalyst was linked to the gene 
coding for chloramphenicol acetyl transferase (CAT) in 
vector pHC-CAT. This vector was then used to transform 
Escherichia coli . Northern blot analysis of the RNA 
produced by the E. coli hosts indicated that auto- 
catalytic cleavage occurred in vivo under standard 
E. coli growth conditions. Similar results were obtained 
when mammalian and plant cells were transformed with a 
vector coding for this autocatalytic RNA linked to either 
the gene coding for CAT (mammalian cells) or to DNA 
♦coding for cauliflower mosaic virus (plants) (see 
Examples 27 and 28) . The autocatalytic RNA catalyst has 
also been used to properly terminate other engineered RNA 
catalysts after they were transcribed. In this manner, 
the engineered RNA catalysts were liberated from the RNA 
transcript and acted in trans in vivo (see Example 31) . 

EXAMPLES 

The following examples further illustrate the 

invention. 

Example 1 

The (-) sense sequence of satellite RNA from 
the budblight strain of tobacco ringspot virus as shown 
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in Fig. 2 was folded using the Wisconsin RNA folding 
program to identify the location of a possible catalytic 
complex accounting for its ability to self cleave 
(University of Wisconsin Genetics Computer Group, Program 
FOLD 5/6/86) (Zucker, M. and Stiegler, P. (1981) Nucleic 
Acids Res. . £: 133-148; Devereux, J., Haeberli, P. and 
Smithies, 0. (1984) Nucleic Acids Res. . 12: 387-395). 
Base numbers correspond to (+)sTRSV RNA (Buzayan f J. M. , 
Gerlach, W. L., Bruening, G. B., Keese, P. and Gould, A. 
R. (1986) Virology . 151 : 186-199). With this numbering 
scheme the 5' -3' direction of the molecule is with 
decreasing base number. 

The minimum catalytic complex, or active site 
of the molecule, is identified in Figure 2. The folding 
identified regions of expected base pairing which are in 
classical double-helical or stem regions. The folding 
also identified expected non-base pairing loops at or 
near the site of cleavage. This model does not preclude 
higher order interactions occurring between non-adjacent 
portions of the catalytic center. 

Example 2 

A 50 nucleotide sequence between bases 175 and 
224 was picked and a 14 nucleotide sequence between bases 
40 and 53 was picked from the catalytic complex identi- 
fied in Example 1. A catalytic RNA (R51) with the 50 
base sequence shown in Fig. 1 plus one additional vector 
base (G at the 5' end) and a substrate RNA (S17) with the 
14 base sequence shown in Fig. 1 plus three vector bases 
(GCG at the 5' end) were transcribed using T7 RNA 
polymerase from synthetic DNA templates double stranded 
at the promoter site (Milligan, J. F. , Groebe, D.R., 
Witherell, G. W. and Uhlenbeck, 0. C. (1987) Nuc. Acids 
Res. , 15 : 8783-8798). The synthetic DNA templates were 
made using phosphoramidite chemistry on an Applied 
Biosystems 381A DNA synthesizer. The template DNAs were: 
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catalytic RNA R51: 3' -ATTATGCTGAGTGATATCTTTGTCTCTTCAGTT- 
GGTCTCTTTGTGTGCAACACCATATAATGGACCAT-5' and substrate RNA 
S17 : 3 ' - ATTATGCTGAGTGATATCGCACTGTCAGGACAAA- 5 * . 

Before transcription, a 18mer or 16mer DNA 
complement to the promoter for T7 RNA polymerase on the 
noncoding strand was hybridized by heating an equimolar 
amount of template DNA with promoter complement to 65 °C 
for 3 min. then placing in ice. A typical transcription 
reaction used 8ng/ul DNA template, 0.5mM each NTP, 2mM 
spermidine, 40mM Tris pH 7.5, 4% polyethylene glycol 
6,000, 6mM MgCl 2 , 4mM NaCl, lOmM dithiothreitol , 0.01% 
Triton X-100, 2.4 units/ul RNasin, 1.8uCi/ul P 32 CTP and 3 
units/ul T7 RNA polymerase (US Biochemical) and was run 
at 37°C for 90 min. 

All in vitro transcribed RNAs were isolated on 
7M urea, 15-20% acrylamide gels, bands cut out and 
isolated. All RNAs were sequenced using standard methods 
(Donis-Keller, H., Maxam, A. M. and Gilbert, W. (1980) 
Nucleic Acids Res. . 4: 2527-2538); a method which also 
gave the 5' terminal base. Terminal bases at the 3' end 
were determined by ligation of the RNA to 5' P 32 pCp using 
T4 RNA ligase (BRL methods manual) , nuclease T2 
digestion, and separation of labelled bases by PEI thin 
layer chromatography in 0.3M LiCl with appropriate stan- 
dards. All RNA sequences corresponded to that expected 
from the DNA template. 

Example 3 

The catalytic RNA R51 was added to the sub- 
strate RNA S17 at a ratio of 1:30 and the time course of 
substrate RNA cleavage was studied. The reaction was 
carried out at 37°C in 12mM MgCl 2 , 40mM Tris pH 7.5 and 
2mM spermidine over a time period of 150 min and is 
summarized as follows: 

5'gcgUGACAGUCCUGUUU 3' > gcgUGUCA + GUCCUGUUU 

(S17) (5'F) (3'F) 
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The reaction products were separated on poly- 
acrylamide/urea gels by electrophoresis and bands cut out 
and counted in a liquid scintillation counter. The 
results are shown in Fig. 3. The time periods analyzed 
were: lane 1, 30 sec; lane 2, 5 min; lane 3, 15 min; 
lane 4, 30 min; lane 5, 60 min; lane 6, 90 min and lane 
7, 150 min. Beginning concentrations were as follows: 
R51*0.0032uM and S17«0.09uM. 

As shown in Fig- 3, the cleavage proceeds to 
virtual completion during the course of the reaction with 
only 2% of the substrate remaining after 150 minutes. 
This shows that, since there was originally 30 times as 
much substrate RNA as catalytic RNA, the RNA catalyst R51 
of necessity interacts with multiple substrate molecules 
during the course of the reaction. In addition, the 
amount of catalyst remained the same and was unaltered, 
indicating that R51 is truly a catalytic entity. 

Example 4 

After the RNA catalyst had been shown to be 
effective in cleaving the RNA substrate as described in 
Example 3, minimum energy folding of the 50 base sequence 
shown in Fig. 1, complexed with the 14 base sequence was 
done using the computer methods described in Example 1. 
The folded complex forms a "hairpin" model or motif as 
shown in Fig. 1 with the substrate RNA sequence and the 
catalytic RNA sequence identified. The arrow is at the 
site of cleavage. 

Example 5 

Various concentrations of substrate S17 were 
used at* constant concentration of catalyst R51 and ini- 
tial velocities of each reaction determined. The reac- 
tion was carried out at 37 °C in 12mM MgCl 2 , 40mM Tris pH 
7.5 and 2mM spermidine. Concentrations were as follows: 
R51=0.0004uM and S17=0.125uM (lane 1), 0.0624uM (lane 2), 



- 48 - 



. 49 - 

0.0417uM (lane 3), 0.031uM (lane 4), 0.021uM (lane 5), 
0.0l56uM (lane 6), 0.0078uM (lane 7) and 0.0039uM (lane 
8) . Each reaction was analyzed on polyacrylamide gels as 
described in Example 3 with the results shown in Fig. 4. 
An Eadie Hofstee plot of catalytic RNA R51 cleavage of 
substrate RNA S17 is shown in Fig. 4. The reaction 
proceeded according to the predictions of the Michaelis- 
Menten equation indicating that it was a truly enzymatic 
reaction in that as the concentration of the substrate 
goes down, the velocity of the reaction goes down. From 
the velocity of the reaction plotted as a function of 
substrate concentration, the Km calculated from the graph 
was 0.03uM and the kcat (turnover number) was 2.1/min. 
This Km is 20 times smaller than the "hammerhead" 
catalysts (see Uhlenbeck, 0. C. (1987) Nature, 328 : 596- 
600) indicating that lower concentrations of substrate 
can be removed. The kcat is 6 times larger than that of 
catalysts fitting the "hammerhead" model at: 37°C (see 
Uhlenbeck, 0. C. (1987) Nature , 32 8 : 596-S00), meaning 
that the reaction is 6 times faster. 

Example 6 

The temperature dependence of the rate of 
cleavage of substrate RNA S17 by catalytic RNA R51 was 
tested over a temperature range and the reaction products 
analyzed on polyacrylamide gels as described in Example 3 
with the results shown in Fig. 5. The reaction was 
carried out in 12mM MgCl 2 , 40mM Tris pH 7.5 and 2mM 
spermidine at 45°C (lane 6), 41°C (lane 5), 37°C (lane 4), 
33°C (lane 3), 27°C (lane 2) and 20°C (lane 1). The 
concentrations used were: R51=0.0016uM and S17«0. 04uM. 
R51 was unlabeled. The velocities shown in the graph in 
Figure 5 were calculated by the use of time points of 8 
and 16 minutes. The separation patterns on the gel shown 
in Figure 5 are for the 16 -minute time point. 
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The reaction showed a temperature dependence 
similar to that which would be expected of a reaction 
involving base paired RNA molecules. The Arrhenius plot 
of the data shown in Fig. 5 gives a temperature optimum 
of 37 °C for the reaction. Higher temperatures reduce the 
reaction rate with a very rapid rate reduction above 41 °C 
consistent with a melting out of the catalytic RNA struc- 
ture. At 50 °C no reaction was detectable. The reaction 
rate at temperatures below 3 7°C showed a linear 
reciprocal temperature dependence consistent with a 
classical lowering of the energy of activation for the 
reaction. The slope of the line in the Arrhenius plot 
gave an energy of activation of 19 Kcal/mole which is 
close to that found for catalysts fitting the 
"hammerhead" cleavage mechanism (13.1 Kcal/mole) 
(Uhlenbeck, O. C. (1987) Nature . 328 : 596-600). 

Example 7 

The rate of cleavage of a constant concentra- 
tion of substrate RNA S17 at varying concentrations of 
catalytic RNA R51 was tested and the reaction products 
analyzed on polyacrylamide gels as described in Example 3 
with the results shown in Fig. 6. The reaction was 
carried out at 37°C in 12mM MgCl 2 , 40mM Tris pH 7.5 and 
2mM spermidine for 4 0 min (lane 1 and 2) , 20 min (lane 
3), 10 min (lane 4) and 5 min (lane 5). The concentra- 
tion of substrate was 0.175uM. The results are plotted 
in Fig. 6 and show that at saturating concentrations of 
substrate the reaction rate is linear with increasing RNA 
catalyst concentrations as one would expect for a true 
catalytic reaction. 

Example 8 

The effect of Mg ++ concentration and pH on the 
rate of cleavage of RNA substrate S17 by RNA catalyst R51 
was determined as shown in the following table: 
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£, 1Q (min) 

no detectable product 
136 
111 
115 
88 
81 • 
74 
62 

£ 1Q (min) 

330 
120 
67 
48 
42 
38 

In the Mg ++ studies, the substrate S17 con- 
centration was 0.l4uM and RNA catalyst R51 concentration 
was 0.0015uM. The reactions were at 37°C in 40mM Tris pH 
7.5. In the pH studies, the substrate S17 concentration 
was 0.062uM and RNA catalyst R51 concentration was 
0.0014UM. The reactions were at 37°C in 40mM Tris for pH 
7.0, 7.5, 8.0 and in 40iriM Pipes for pH 5.5, 6.0 and 6.5. 

The dependence of the reaction rate on Mg ++ and 
pH are virtually identical with those of catalysts 
fitting the "hammerhead" model. The reaction rate in- 
creases with increasing pH as one would expect for a base 
catalyzed reaction but the effect is masked by the 
catalytic activity of the RNA. Hence a 100 fold increase 
in [OH"] between pH 6.0 and 8.0 resulted in only a 3 fold 
increase in the reaction rate. 

Example 9 

A 10 base substrate (S10) was prepared by the 
methodology of Example 2 . When the substrate was mixed 
with catalytic RNA R51, the reaction is summarized as 
follows: 
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5'GACAGUCCUG 3' M ^ GACA + GUCCUG 

(S10) (5'F) (3'F) 

The results of rate studies with substrate S10 
comparable to those described with S17 in Example 5 
showed a Km=0.06uM and a kcat=0 . 8/min. These results are 
shown in Fig. 7 and indicate that smaller substrates can 
be used, but not as efficiently. 

Example 10 

The 3' terminal base of the catalyst shown in 
the "hairpin" model of the (-)sTRSV catalytic complex in 
Fig, l is at position 175. Two catalytic RNAs were 
prepared with exactly the same sequence as R51, except 
that one of them did not contain the 3' terminal "A" base 
(position 175} and the other one did not contain the 3' 
terminal "UA" sequence (positions 176 and 175) . 
Synthesis of these catalytic RNAs , designated R50 and 
R49, respectively, was carried out as described in 
Example 2. R50 or R49 catalytic RNA was mixed with 
substrate RNA S17 under standard conditions of reaction 
and the products analyzed as described in Example 3. The 
results are given in Fig. 8 and show a 75% reduction in 
activity with either R50 or R49 as compared to the 
activity of catalytic RNA R51 having the 3' terminal "A" 
at position 175 and the 3' terminal "AU" at positions 175 
and 176. 

Example 11 

An RNA with the same sequence as catalytic RNA 
R51 was prepared, except that bases 195-203 were omitted 
such that base 194 was in effect ligated to base 204. 
This RNA molecule was prepared as described in Example 2 
from an appropriate DNA template containing the com- 
plementary sequence. When this RNA was mixed with sub- 
strate RNA S17 as described in Example 3, no reaction 
occurred. These results show that major elements of the 
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"hairpin" structure are required for RNA catalysis to 
occur and that removal of 9 bases (see Pig. 9) in the 
middle inactivates the catalytic RNA. 

Example 12 

An RNA with the same sequence as R51, except 
that the bases AAA at positions 203, 202 and 201 were 
changed to CGU, respectively, was prepared as described 
in Example 2 using an appropriate DNA primer. When this 
potential RNA catalyst was mixed with substrate RNA S17 
as described in Example 3, no reaction occurred. This 
shows that the integrity of one or all of these bases 
(see Fig. 10) is required for catalytic activity. 

Example 13 

A substrate RNA with the base at position 49 in 
Fig. 1 changed from an "A" to a "G" was prepared as de- 
scribed in Example 2. When this substrate was reacted 
with the RNA catalyst R51, no difference in rate of 
reaction was seen between this substrate and the sub- 
strate containing the "A" at position 49 (see Fig. 11A) . 
This shows that alterations can occur in the "A" base in 
the substrate RNA AGUC loop without affecting the ability 
of the catalytic RNA to cleave the substrate. 

Example 14 

Another substrate RNA identical to S17 but 
having "A" replaced by "U" in the AGUC loop was prepared 
as described in Example 2 (designated "S17 (A- - >U) " ) . 
This substrate RNA, at a concentration of 0.12uM, was 
reacted with the catalytic RNA R51, at a concentration of 
0.0065uM, under standard conditions as described in 
Example 3 for 60 minutes. The results are shown in 
Figure 11B where Lane 1 contains the products of the 
reaction of substrate RNA S17(A-->U) with R51 catalytic 
RNA. No difference in the rate of reaction was seen 
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between S17(A-->U) substrate RNA and substrate S17 
containing the "A" base at position 49. 

Example 15 

Another substrate RNA identical to S17 but 
having "A" replaced by B C n in the AGUC loop was prepared 
as described in Example 2 (designated "S17 (A- - >C) n ) . 
This substrate RNA, at a concentration of 0.08uM, was re- 
acted with the catalytic RNA R51, at a concentration of 
0.0065uM, under standard conditions as described in 
Example 3 for 60 minutes. The results are shown in 
Figure 11C where Lane 1 contains the products of the 
reaction of S17(A-->C) substrate RNA with R51 catalytic 
RNA. Again, no difference was seen in the rate of re- 
action using S17(A-->C) as compared to S17 containing the 
n A" base at position 49. The combined results of 
Examples 13-15 show that the base at position 49 in the 
cleavage sequence of the substrate may be any base. 

Example 16 

Base changes in the stem regions at the site of 
binding of the substrate RNA to the catalytic RNA in the 
"hairpin" configuration can be made as long as the base 
pairing is maintained. The n C n base at position 50 of 
the substrate was changed to a "G" using the methods in 
Example 2 . When this substrate RNA was reacted with the 
catalytic RNA R51, no cleavage of this substrate 
occurred. However, when a new catalytic RNA, containing 
a "C" at position 214, rather than the n G" found in R51, 
was synthesized according to the methods in Example 2 and 
added to this substrate, full cleavage was seen. The 
effect of the base change from "C n to n G n in the sub- 
strate was to eliminate the base pairing at this position 
predicted by the "hairpin" model since now a "G" would be 
across from a "G". However, when a "G n to "C" base 
change was made in the catalytic RNA, the base pairing 
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was restored, but in a reverse manner, and the integrity 
of the helices in the stem regions where the substrate 
RNA binds to the catalytic RNA was thus conserved restor- 
ing catalytic activity (see Pig. 12) . 

Example 17 

An RNA sequence found within the sequence of 
tobacco mosaic virus was synthesized using the methods 
described in Example 2. This synthesized target RNA had 
the sequence 5 ' gAAACAGUCCCCAAC 3'. A catalytic RNA was 
synthesized with the sequence 5 ' -GUUGGGAGAAGUUUACCA- 
GAGAAACACACGUUGUGGUAUAUUACCUGGUA- 3 ' selected so that base 
pairing between the substrate and the catalytic RNA is 
maintained in the "hairpin" configuration (see Fig. 13) . 
When these two RNAs were mixed under standard catalytic 
conditions as described in Example 3, the target was 
cleaved demonstrating that a sequence found within a 
native viral RNA can be cleaved. 

Example 18 

Three RNA sequences found within the sequence 
of the messenger RNA for the enzyme chloramphenicol 
acetyl transferase (CAT) were synthesized using the meth- 
ods described in Example 2. The synthesized substrate 
RNAs had the sequences (A) gUUUCAGUCAGUUGC ; (B) 
gUUUCAGUCAGUUGCUC ; and (C) gggUUUCAGUCAGUUGCUCAA (see 
Fig. 14) . 

Note that the latter two sequences are ex- 
tensions of the first sequence and that additional bases 
have been added to the 3' end in the region that the 
"hairpin" model predicts will base pair with the cataly- 
tic RNA to form Helix 1 (see Figure 18)-. Also, substrate 
(C) had two additional G vector bases as compared to sub- 
strates (A) and (B) . The site of cleavage after the A in 
the AGUC cleavage sequence (see the arrow in Figure 14) 
of the substrates corresponds to position 320 of the CAT 
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gene using the number system found in the Tn9 sequence 
(Alton and Vapnak, Nature . 282 . 864 (1979)). In Fig- 
ure 14, the open boxed bases are those which are dif- 
ferent from those in the native (-)sTRSV substrate RNA 
sequence shown in Figure 1. 

Catalytic RNAs corresponding to substrate RNAs 
(A) , (B) and (C) were synthesized according to the 
methods described in Example 2. Their sequences were 
designed so that they would base pair with the substrate 
RNAs in both the 3' and 5' regions flanking the AGUC 
cleavage sequence. In addition, the catalytic RNAs 
designed to react with substrate RNAs (A) and (B) each 
contained the vector sequence GA at their 5' terminus, 
and the catalytic RNA designed to react with substrate 
RNA (C) contained the vector sequence GGG at its 5' 
terminus. Otherwise, the catalytic RNAs had the same 
sequence as the (-)sTRSV catalytic RNA sequence shown in 
Figure 1. 

The various substrate and catalytic RNAs were 
reacted and the reaction products analyzed as described 
in Example 3 . All reaction conditions were as described 
in Example 3, except for the following. For substrates 
(A) and (B) , reaction conditions were: substrate RNA 
concentration = O.OSuM; catalytic RNA concentration = 
0.005uM; reaction run at 16 °C; and reaction time of 20 
minutes. For substrate (C) , the reaction conditions were 
the same as for (A) and (B) , except that the reaction 
time was 40 minutes and temperatures were 20°C, 25°C, 
30°C and 37°C. 

Cleavage of all of the substrate RNAs occurred 
when they were mixed with the corresponding catalytic 
RNAs as is shown in Figure 14, demonstrating that cataly- 
tic RNAs according to the invention can be synthesized 
which cleave specific RNA sequences found within a mes- 
senger RNA. In addition, this example demonstrates that 
cleavage of substrate RNA can be obtained even though the 
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length of the region at the 3' end of the substrate which 
base pairs with the catalyst ( i.e. . the portion that 
forms Helix 1 with the substrate according to the 
"hairpin" model) is varied. Indeed, when the length of 
this region was extended to 10 bases in substrate (C) , 
the reaction could then proceed at 37 °C, whereas for 
substrates (A) and (B) having shorter sequences in this 
region, the reaction would proceed only at 16 °C. 

Example 19 

An RNA substrate corresponding to part of the 
sequence of HIV-l, the virus which causes AIDS, was 
synthesized as described in Example 2. The sequence of 
this substrate RNA is shown in Figure 15. The arrow in 
Figure 15 shows the cleavage site which corresponds to 
position 804 in the sequence of HIV-l strain SF2CG 
(Sanchez -Pescador et al., Science . 227 . 484 (1985). The 
sequence shown is found in the region of the viral RNA 
which specifies the gag protein. The RNA substrate also 
had a GCG 5' vector sequence. The open boxed bases in 
Figure 15 are those which are different than those of the 
native (-)sTRSV substrate sequence shown in Figure 1. 

A catalytic RNA was synthesized according to 
the methods of Example 2 . Its sequence was designed so 
that it would base pair with the substrate RNA in both 
the 3* and 5' regions flanking the CGUC cleavage 
sequence. In addition, the catalytic RNA contained the 
vector sequence GGG at its 5' terminus. Otherwise, the 
catalytic RNA had the same sequence as the (-)sTRSV 
catalytic RNA sequence shown in Figure 1. 

The catalytic RNA and the substrate RNA were 
reacted and the reaction products were analyzed as de- 
scribed in Example 3. The reaction conditions were as 
set forth in Example 3 f except that the following temp- 
eratures were used: 20°C, 25°C, 30°C and 37°C. Also, 
the reaction was run for 60 minutes, except: for 37°C which 
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was run for 15 minutes, and the substrate RNA concentra- 
tion was 50nM, and the catalytic RNA concentration was 
5nM. 

The catalytic RNA cleaved the substrate at the 
expected position between the "C" and "G" in the CGVC 
cleavage sequence found in the loop between the two 
flanking stem regions. Thus, a specific sequence found 
in the HXV-1 viral RNA that codes for the gag protein can 
be cleaved with a catalytic RNA according to the 
invention. The reaction occurred with an RNA substrate 
having a 16 -base target sequence which was longer than 
S17 in the region at the 3' end which base pairs with the 
catalyst ( i.e. . the portion that forms Helix 1 with the 
catalytic RNA according to the "hairpin" model) . Also, 
the reaction occurred at physiological temperature of 
37°C. 

Example 2Q 

A substrate RNA having a sequence found in the 
beginning of the coding region for the regulatory protein 
tat of HIV-l virus was synthesized as described in 
Example 2. The substrate sequence is shown in Figure 16. 
In addition to the HIV- derived sequence, the substrate 
RNA had a GCG 5' vector sequence. The open boxes are 
around bases which are different than those of the native 
(-)sTRSV substrate sequence shown in Figure 1. 

A catalytic RNA having a sequence so that it 
would base pair with the substrate RNA in the two regions 
flanking the UGUC loop ( i.e. . the regions that forms 
Helices 1 and 2 with the catalytic RNA according to the 
"hairpin" model) was also synthesized as described in 
Example 2. Otherwise, the catalytic RNAs had the same 
sequence as the (-)sTRSV catalytic RNA sequence shown in 
Figure 1, except that it had an additional 5'G vector 
base. 
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The substrate RNA and catalytic RNA were re- 
acted and the reaction products were analyzed as de- 
scribed in Example 3. Reaction conditions were: 37°C; 
reaction times of zero and 15 minutes; the concentration 
of substrate RNA was lOOnM; and the concentration of 
catalytic RNA was 20nM. 

Cleavage occurred as expected between the "U" 
and the "G n in the UGUC cleavage sequence located between 
the two stem regions of the substrate. The cleavage site 
is indicated by the arrow in Figure 16- This is position 
5366 in the sequence of HIV clone h9c (Muesing et al., 
Nature . 313 . 450 (1985)}. These results again confirm 
that a catalytic RNA designed according to the "hairpin" 
model can cleave a specific target sequence located in a 
naturally occurring RNA, in this case a key regulatory 
region (tat) of the HIV-1 virus which causes AIDS. 

Example 21 

Using the methods described in Example 2, a 
substrate RNA having four non-native bases (UUUU) added 
to the 3' end of the sequence of the native (-)sTRSV 
substrate shown in Figure 1 and a corresponding catalytic 
RNA designed to base-pair with the substrate RNA in the 
3' and 5' regions of the substrate flanking the cleavage 
sequence ( i.e. . the portions that form Helices 1 and 2 
with the catalyst according to the "hairpin" model) were 
made. Thus, the substrate RNA contained a total of 18 
bases, including a four-base extension of the 3' region 
that base pairs with the catalyst. The substrate RNA 
also had an additional GCG vector sequence at the 5' end. 
The catalyst had the same sequence as the ( - ) sTRSV 
catalytic RNA sequence shown in Figure 1, except that it 
had four additional AAAA bases at the 5 'end designed to 
base pair with the added UUUU bases of the substrate and 
had an additional "G" vector base at the 5' terminus 
beyond the 18 base recognition site. 
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The substrate and catalytic RNAs were reacted 
at standard conditions and the reaction products were 
analyzed as described in Examples 3 and 5. Catalytic RNA 
concentration was 0.00033uM, and substrate RNA con- 
centration was O.luM (Lane 1), 0.05uM (Lane 2), 0.033uM 
(Lane 3), 0.025uM (Lane 4), 0.016uM (Lane 5), 0.012uM 
(Lane 6), 0.006uM (Lane 7) , and 0.003uM (Lane 8). The 
results are shown in Figure 17. Cleavage rates at each 
concentration of substrate were determined by cutting out 
the bands, counting radioactivity in a liquid scintilla- 
tion counter and plotting the data using Michael is -Men ton 
procedures to calculate Km and kcat (see Example 5) . 

The data show that an extension of the length 
of the region of base pairing between the substrate and 
catalyst (i.e., those regions of the catalyst and sub- 
strate that form Helix 1 according to the "hairpin" 
model) can improve the catalytic properties of the re- 
action. Cleavage of the 18-base RNA substrate occurred 
at the expected site, but at an increased rate as com- 
pared to the cleavage of S17 by R51. The kcat or turn- 
over number of the reaction was 7/minute. This means 
that each molecule of catalytic RNA cleaved 7 molecules 
of substrate RNA per minute during the reaction. The 
kcat for S17 cleavage by R51 was 2,1/minute. The Km of 
the reaction was the same as for S17 cleavage by R51 
(30nM) . This shows that by optimizing the length of the 
region of the catalyst that base pairs to the substrate 
in the 3' region flanking the cleavage sequence (i.e. . by 
optimizing the length of Helix 1 predicted by the 
"hairpin" model), the catalytic properties of the native 
reaction can be improved. 

Example 22 

A series of catalytic RNAs were prepared using 
the methods described in Example 2 having certain base 
substitutions as compared to the native (-)sTRSV cataly- 
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tic RNA sequence shown in Figure 1. The substitutions, 
which are illustrated in Figure 19, were the following: 
(1) At nucleotide 35, G was replaced by C (G35-->C) 
(count bases from the 5' end of the catalytic RNA se- 
quence shown) ; (2) A double mutant, having the G at 
position 35 replaced by C and the C at position 27 re- 
placed by G (G35-->C; C27-->G); (3) At nucleotide 47, G 
was replaced by C (G47-->C); and (4) A double mutant 
having the G at position 47 replaced by C and the C at 
position 17 replaced by G (G47-->C; C17-->G) . All 
catalytic RNAs had an additional n G n vector base at their 
5' end. 

The various catalytic RNAs were reacted with 
substrate S17 and the reaction products were analyzed as 
described in Example 3. The results are shown in Figure 
19 where Lanes 1, 3, 5, 7, 9 and 11 are at zero time, and 
Lanes 2, 4, 6, 8, 10 and 12 are 15 minutes incubation 
under standard cleavage conditions (see Example 3) • The 
concentration of catalytic RNA was 0.0065uM, and the con- 
centration of substrate RNA was 0.17uM. The temperature 
was 37°C, The control, lanes 1, 2, 7 and 8, was cleavage 
of the native substrate RNA S17 by catalytic RNA R51 
which has the native (-)sTRSV sequence (see Figure 1). 

As shown in Figure 19, the catalytic RNA 
G35-->C had no catalytic activity (see Lanes 3 and 4 of 
Figure 19, where this catalytic RNA is designated "mis- 
match" since the base substitution at position at posi- 
tion 35 results in a loss of base pairing) . The double 
mutant catalytic RNA G35-->C; C27-->G showed restored 
catalytic activity (see Lanes 5 and 6 of Figure 19 where 
this catalytic RNA is designated "substitute b.p." since 
the net effect of the two base substitutions is to create 
a base pair, but a base pair different than the one found 
in the native (-)sTRSV catalytic RNA). The catalytic RNA 
G47-->C was also inactive (see Lanes 9 and 10 of Figure 
19), while the double mutant, with the second mutation 
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C17-->G, showed restored activity (see Lanes 11 and 12 of 
Figure 19) . 

The results of these mutagenesis studies pro- 
vide confirmation for the existence of Helices 3 and 4 
(see Figure 19) of the "hairpin" catalytic RNA model 
proposed herein- The results show that when mutagenesis 
was carried out which caused a base pair mismatch in the 
region of proposed base pairing, the catalytic RNA was 
inactive- However, when a second mutation was carried 
out so that the base pair was restored, catalytic 
activity was restored, even though the new base pair was 
different than the original base pair. Such results are 
considered evidence for the existence of a helix (Fox and 
Woese, Nature , 256 . 505 (1975). These results also show 
that a variety of catalytic RNAs having sequences dif- 
ferent from the native (-)sTRSV sequence are catalyti- 
cally active if they are designed so that they follow 
certain rules derived from the "hairpin" model, such as 
maintenance of substantial base pairing in regions of 
predicted helices. 

Example 23 

A synthetic "hairpin" autocatalyt ic cassette 
was prepared. The RNA sequence of the cassette is shown 
in Figure 20. Several bases have been added at the 5' 
end of the catalyst as compared to the native (-)sTRSV 
sequence shown in Figure 1. The effect is to close the 
open end of the "hairpin" and to provide a substrate 
sequence ( i.e. . a cleavage sequence and upstream and 
downstream flanking regions) which can base pair with the 
substrate binding portion of the catalyst. 

The cassette was prepared by making an appro- 
priate synthetic DNA template that would yield an RNA 
with the sequence shown in Figure 20 and using the DNA 
template to transcribe RNA as described in Example 2, 
with the following differences. After synthesizing the 
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DNA template, it was inserted into the vector pTZl8R (US 
Biochemical) into which a new multiple cloning site had 
been inserted. The new multiple cloning site was a 
construct containing sites, in 5'->3' order, for cleavage 
by the following enzymes: BamH I. Xhol, At>a l. £acll, 
Nae l. Stul. Kpn I. Mlul. The new multiple cloning site 
was inserted into vector pTZ18R by cleaving the vector 
with BamHI and Sai l and then ligating the multiple clon- 
ing site to the vector using T4 ligase. Located 5' to 
the inserted multiple cloning site is the T7 RNA poly- 
merase promoter. The vector containing the multiple 
cloning site was digested with Mlu l and Sai l, and the DNA 
template coding for the autocatalytic cassette was 
ligated into the vector using T4 ligase. The resultant 
vector was then linearized with Hin dlll. and transcrip- 
tion carried out as described in Example 2. All res- 
triction enzymes and the T4 ligase were obtained from IBI 
and used according to manufacturer's instructions. 

After being transcribed, the cassette sponta- 
neously underwent an intramolecular autocatalytic cleav- 
age at the expected site to give the appropriate 3'F and 
5'F products (see Figure 20). Note that the effect of 
this is to autocatalytically terminate a transcript. For 
example, the 5'F is in itself a transcript which has been 
terminated at its 3' end by the autocatalytic reaction. 
Further note that this termination only leaves five 
essential bases at the 3' end of this 5'F. These are 
UGACA which are boxed in Figure 20. Thus, it is possible 
to very efficiently terminate transcription with this 
autocatalytic cassette and leave only a very short 3' end 
to the transcript ( i.e. . the 5' fragment of the auto- 
catalytic cleavage) . 

Example 24 

A catalytic RNA was prepared as described in 
Example 2. Its sequence was identical to that of R51, 
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except that the base at position 217 in the AGAA loop was 
changed from a G to a C. The AGAA loop of the catalyst 
is opposite the cleavage sequence of the substrate when 
the substrate and catalyst are complexed (see Fig. 1) 

Another catalytic RNA was prepared as described 
in Example 2. Its sequence was also identical to R51, 
except that the base at position 216 was changed from A 
to U. 

These catalysts were reacted with the RNA 
substrate S17 under standard conditions as described in 
Example 3 at 37°C for 0 and 15 minutes- The control was 
the reaction of substrate S17 with catalyst R51. The 
concentration of the three catalytic RNAs was 0.007uM, 
and the concentration of substrate RNA was 0.7uM. 

The reaction products were analyzed as de- 
scribed in Example 3, and the results are shown in Figure 
21, where the first lane in each gel is zero time and the 
second lane is 15 minutes of reaction. As shown in 
Figure 21, changes in either one of these two bases 
(G217-->C and A216-->U) in the loop opposite the cleavage 
sequence of the substrate destroyed the activity of the 
catalyst. These results indicate that these two bases 
are likely invariant in the native (-)sTRSV catalytic 
sequence. 

A third catalytic RNA was prepared as described 
in Example 2. Its sequence was identical to R51, except 
that the base at position 218 was changed from A to C. 
This catalyst was also reacted with substrate S17 as 
described above. Cleavage of the substrate with this 
catalyst was observed, but only at about 47% of the level 
achieved with R51 (data not shown) . 

Example 25 

Substrate RNAs identical to S17, but with one 
of the bases at positions 46, 47 or 48 (i.e. , bases GUC 
of the cleavage sequence) changed to a different base. 
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were prepared as described in Example 2. When these 
substrates were reacted with the RNA catalyst R51 under 
standard conditions for 60 minutes as described in 
Example 3, no cleavage of the substrates occurred. This 
shows conservation of the GUC sequence of the cleavage 
sequence of (-)sTRSV RNA. The results of these experi- 
ments combined with the results of Examples 13-15 show 
that the cleavage sequence of (-)sTRSV RNA is NGUC, where 
N is any base. 

Example 2$ 

The vector prepared in Example 23 (hereinafter 
referred to as "pHC") containing the "hairpin" auto- 
catalytic cassette wa£ tested for activity in vivo as 
follows. First, the CAT gene was removed from plasmid 
pMAMNEO - CAT (purchased f rom Clontech Inc.) with Smal and 
Xho l. It was then ligated using T4 ligase into pHC which 
had been cut with Sma l and Xho l to produce vector pHC- 
CAT. 

The original vector used in these constructions 
(pTZ18R; see Example 23) contains an inducible promotor 
(lacZ) and, as a result of the steps described in Example 
23 and immediately above in this example, the CAT gene 
and the "hairpin" autocatalytic cassette were placed in 
this inducible region (see Figure 22A) . Also, the CAT 
gene and "hairpin" autocatalytic cassette were linked so 
that the expected transcript would be as shown in Figure 
22A (the "CAT- cassette RNA") and so that ttie RNA sequence 
coded for by the "hairpin" autocatalytic cassette would 
be expected to serve as a chain terminator for the CAT 
transcript by cleaving at the indicated cleavage site 
(see Figure 22A) . The expected 5' fragment of such a 
cleavage is also shown in Figure 22A ("Cleaved CAT- 
cassette RNA") . 

Next, pHC-CAT was trans fee ted into Escherichia 
coli strain JM109 (widely available from a number of 
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commercial sources and from the American Type Culture 
Collection) with calcium chloride and heat shock using 
standard procedures as described in Maniatis et al., 
Molecular Cloning (1983) . Transformed cells containing 
pHC-CAT were selected on the basis of ampicillin 
resistance by plating on double concentration YT medium 
containing 100 ug/ml ampicillin. After selection, the 
transformed cells were grown overnight in LB broth at 
37°C. A fresh culture of these cells was then grown for 5 
hours in LB broth at 37°C, after which the cells were 
induced with ImM isopropyl-beta-D- thiogalactoside (IPTG) 
for one hour to allow expression of the lacZ region, 
including the CAT gene-cassette RNA transcript. 

At the end of this time, RNA was isolated by 
incubating the cells in 50 mM Tris, pH 8.0, 50mM 
ethylenediaminetetraacetic acid (EDTA) , 1 mg/ml lysozyme 
at room temperature for 10 minutes to lyse the cells. 
The lysate was made to 0.5% sodium dodecyl sulfate (SDS) , 
and then centrifuged to remove cell debris. Phenol was 
added to the supernatant at a ratio of 1:1, and the 
supernatant was centrifuged to remove the precipitate. 
This procedure was repeated, and the resulting aqueous 
phase was treated with an equal volume of isopropanol 
at -20°C for 20 minutes to precipitate the RNA. The 
precipitate was collected by centrifugation, dried, and 
redissolved in water. 

This isolated RNA was electrophoresed on 1.2% 
agarose gels containing formaldehyde as described in 
Current Protocols In Molecular Biology (Greene 19 89). 
After electrophoresis, Northern blots were carried out 
using published methods ( GeneScreen Plus . DuPont, July 
19 85) . Two DNA probes were used for blotting the gels. 
The CAT probe was prepared by primer extension of the CAT 
gene using the Klenow fragment of DNA polymerase I and 
dATP labelled with alpha P 32 . The "hairpin" autocatalytic 
cassette probe was prepared by kinasing the DNA com- 
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plement to the entire "hairpin" autocatalytic cassette 
RNA sequence shown in Figure 20 with dATP labelled with 
gamma P 32 . The expected positions of binding of these 
probes to the CAT- cassette RNA and expected 5' fragment 
are shown in Figure 22 A. 

The results of the Northern blot test are shown 
in Figure 22B. As shown there, when the CAT probe was 
used, the full length CAT- cassette RNA transcript and the 
expected 5' fragment were detected on the gel, indicating 
that cleavage had taken. place in vivo . 

When the "hairpin" autocatalytic cassette probe 
was used, only the full length CAT- cassette RNA tran- 
script was detected (see Figure 22B) . The fact that the 
5' fragment did not hybridize with this probe was to be 
expected, since most of the "hairpin" autocatalytic 
cassette transcript would be in the 3' fragment after 
cleavage. Although it would be expected that the 
"hairpin" autocatalytic cassette probe would hybridize to 
the 3' fragment, the fact that the 3' fragment was not 
detected by Northern blot analysis is not surprising. 
The 5' terminus of the 3' fragment would contain a 5' -OH 
and not the 5'-ppp which is ordinarily seen in RNA tran- 
scripts. Thus, the 3' fragment would be expected to be 
very labile in vivo and was likely degraded immediately 
after the autocatalytic cleavage. 

The mobilities of the CAT- cassette RNA and the 
Cleaved CAT- cassette 5' fragment on the gel corresponded 
exactly to the predicted length of the transcripts. The 
standards run on the gel were E. coli 16S and 23S RNA. 

An important control was to determine whether 
cleavage of the target RNA occurred during isolation of 
the RNA. Intact, uncleaved "hairpin" autocatalytic RNA 
prepared and separated on an acrylamide gel as described 
in Example 23 was isolated from the gel using conditions 
similar to those described above ( i.e. . no divalent 
cation, but in the presence of EDTA and SDS) . Only 
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intact, uncleaved RNA was obtained when the catalyst was 
re-electrophoresed on an acrylamide gel. 

Example 27 

Mammalian expression vector pMHC-CAT containing 
the "hairpin" autocatalytic cassette linked to the CAT 
gene was constructed as follows. First, the "hairpin" 
autocatalytic cassette was removed from the vector pHC 
(prepared as described in Example 23) by digesting pHC 
with Smal/Sall and ligating the resulting fragment to the 
mammalian vector pMSG (purchased from Pharmacia) at the 
Smal/Xhol sites to give pMHC. This operation removed the 
Xhol site used to clone the "hairpin" autocatalytic 
cassette insert, but introduced another Xhol site near 
the 5 'end of the insert. 

Next, the CAT gene was removed from pMAM-NEO- 
CAT (Clontech, Palo Alto, CA) with Smal/Xhol and cloned 
into pMHC at the Smal/Xhol sites to give the final vector 
pMHC-CAT which is shown in Figure 23. Note that the CAT 
gene and the "hairpin" autocatalytic cassette are both 
driven by the MMTV LTR promoter which is dexamethasone 
inducible . 

The vector pMHC-CAT was co- transf ected into 
Chinese hamster ovary cells (CHO) by the calcium 
phosphate method along with the vector pMSG-dhfr (a gift 
from Dr. Ed Prochownick of the University of Michigan, 
Ann Arbor, MI) which contained a mutant dhfr gene- The 
isolation of this mutant dhfr gene is described in 
Simonson and Levinson, Proc. Nat 'l Acad. Sci. 2495-99 
(1983) , and pMSG is available from Pharmacia (see above) . 

Transf ectants were selected using gpt medium 
( Molec. & Cell. Biol. . 3, 1421-29 (1983)) which kills 
non- transf ected cells. They were next amplified with 
methotrexate (which amplifies the mutant dhfr gene and 
associated DNA) to give CHO cells which contained the 
"hairpin" autocatalytic cassette linked to the CAT gene 
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stably integrated into the genome. Dexamethasone (luM) 
was added to induce expression of the CAT gene and 
"hairpin" autocatalytic cassette, and RNA was isolated 
from the individual cells by the guanidinium isothio- 
cyanate procedure (Current Protocols in Molecular 
Biology) . 

Next, a nuclease protection assay was performed 
to identify the nature of the transcripts and cleavage 
products, if any, produced by transcription of the 
"hairpin" autocatalytic cassette-GAT gene fusion. The 
probe used in the SI nuclease assay was prepared from the 
vector pHC19R. This vector was prepared by removing the 
"hairpin" autocatalytic cassette from pHC with BamHI/Sall 
and ligating this fragment into the BamHI/Sall sites of 
the vector pTZl9R (purchased from US Biochemical) as 
shown in Figure 24. Vector pHC19R was linearized with 
BamHI, transcribed with T7 RNA polymerase using P 32 -CTP as 
described in Example 2, and the transcript was isolated 
from 7M urea gels, also as described in Example 2. This 
RNA probe is 148nt long and is the complement to the 
"hairpin" autocatalytic cassette RNA. 

The probe was then hybridized in buffer (40mM 
Pipes, pH6, lOOmM NaCl, 20% formamide, ImM ZnS0 4 ) to RNA 
(5ug) isolated from CHO cells that had been transfected 
with vector pMHC-CAT, and the hybridized R2*A was digested 
with SI nuclease (100 units, 1 hr, , 37°C) . SI nuclease 
will digest all single- stranded RNA, but not double- 
stranded RNA. A negative control was RNA from trans- 
fected CHO cells which had not been induced* A positive 
control was RNA isolated from the pHC plasrnid which had 
been transcribed in vitro with T7 RNA polymerase as 
described in Example 2. 

The Si digests were electrophoresed on 10% 
acrylamide-7M urea gels. The results are shown in Figure 
25 where: Lane 1 contains the RNA probe alone; Lane 2 
contains the probe digested with SI nuclease; Lane 3 
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contains in vitro transcribed "hairpin" autocatalytic 
cassette RNA, 5'F and 3'F that have been hybridized to 
probe and digested with SI nuclease (positive control) ; 
Lane 4 contains RNA isolated from CHO cells that had been 
transfected with pMHC-CAT but were not induced by 
dexamethasone, hybridized to the probe and digested with 
SI nuclease; and Lane 5 contains RNA from CHO cells that 
had been transfected with pMHC-CAT and induced with 
dexamethasone, hybridized to the probe and digested with 
SI nuclease. All mobilities were as expected, and, as 
can been seen in Lane 5, RNA transcribed from the 
"hairpin" autocatalytic cassette- CAT gene fusion was 
cleaved in vivo into the expected products . 

Another control was uncleaved "hairpin" auto- 
catalytic cassette RNA hybridized to the probe, SI 
nuclease digested and electrophoresed under the same 
conditions. No cleavage products were seen (data not 
shown) . This control shows that cleavage of the 
"hairpin" autocatalytic cassette RNA did not occur as a 
result of the analysis conditions and, therefore, that 
the cleavage products seen in Lane 5 must have been 
produced as a result of in vivo cleavage of the hybrid 
CAT- "hairpin" autocatalytic RNA, 

Example 28 

The following example illustrates in vivo 
activity of the "hairpin" catalytic RNA in plants. The 
"hairpin" autocatalytic cassette (see Example 23) was 
ligated to cauliflower mosaic virus (CMV) In a viral 
vector. Plants were then transformed with the resulting 
vector and, during replication of the virus, the 
"hairpin" autocatalytic cassette RNA cleaved the viral 
RNA intramolecularly. The viral RNA serves as a template 
for viral replication and for attenuation of the virus. 
Since cleavage levels of the "hairpin" autocatalytic 
cassette were about 50-60% in vitro , attenuated viral 
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infection in these plants would be expected if the 
"hairpin" autocatalytic cassette RNA linked to the CMV 
RNA did cleave in vivo in the plants, and this was what 
was observed. 

The constructions tested had the "hairpin" 
autocatalytic cassette in the sense and antisense 
orientation. They are shown in Figure 26. 

These constructions were made by removing the 
"hairpin" autocatalytic cassette from vector pHC 
(prepared as described in Example 23) with Xhol/Sall and 
ligating this fragment into the unique Xhoi site of CMV 
plasmid pCSlOl (a gift from Dr. Art Hunt, University of 
Kentucky) . Plasmid pCSlOl contains the entire CMV 
sequence. The Xhoi site is located in gene II of CMV and 
was chosen because DNA can be cloned into this site 
without subsequently interfering with plant infection. 
Another plasmid identical to pCSlOl, except having a 
pBR322 bacterial replicon, is available from American 
Type Culture Collection, and can be used in its place as 
a source of CMV. The "hairpin" autocatalytic cassette 
could be inserted in either of two orientations (sense or 
antisense) since Xhoi and Sail ends are compatible. 

The resulting constructions were grown in XL-l 
blue E. coli (Stratagene) , and clones of the "hairpin" 
autocatalytic cassette insert were isolated and 
identified by cleaving with Kpnl and by electrophoresing 
the insert on 1% agarose gels. The isolated clones were 
pCS101HC7 and pCS101HC9 which had the "hairpin" auto- 
catalytic cassette in the sense and antisense orienta- 
tion, respectively. These constructions are shown in 
Figure 26. Construction pCS101HC7 was designed so that 
CMV 35S RNA containing the "hairpin" autocatalytic 
cassette would be transcribed from it and cleaved if the 
"hairpin" autocatalytic cassette was active in vivo . 

After being grown in E. coli . the constructions 
were cut with Sail, and were rubbed onto 10 turnip 
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( Brassica campestis ) plants (0.5ug DNA/plant) which were 
three weeks old. Four groups of plants were treated as 
follows: 



A. Control -mock inoculated (no virus) 

B. Virus control - no "hairpin* autocatalytic 
cassette insert - pCSlOl 

C. Virus with the sense "hairpin" auto- 
catalytic cassette - pCS10lHC7 

D. Virus with the antisense "hairpin" auto- 
catalytic cassette - pCS101HC9 

The plants were checked weekly for viral 



symptoms. Within six weeks, plants treated with wild- 
type virus were observed to have viral symptoms, includ- 
ing vein clearing, leaf wrinkling and a yellow mosaic 
pattern on the leaf. In turnips treated with pCSl0lHC7, 
the onset of symptoms was delayed 7-10 days as compared 
to turnips infected with wild- type virus, and the sever- 
ity of the symptoms never reached the levels attained 
with the wild- type virus. The following was observed 
when the plants were nine weeks old: 



Treatment 



Symptoms 



A 



No symptoms 



B 



Viral symptoms, worst case, 
most yellowing (chlorosis) 
and most lesions. 



C 



Viral symptoms, distinct 
chlorosis and lesions, 
but much less than in the 
virus control . 



D 



Viral symptoms, less than 
viral control, but not 
as much yellowing as the 
virus with the sense 
"hairpin" autocatalytic 
cassette. Some plants 
looked uninfected. 
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These results indicate that a "hairpin" 
catalytic RNA according to the invention can cleave viral 
RNA in which it is inserted so that the viral infection 
is attenuated (Treatment C) . Surprisingly, viral attenu- 
ation was also obtained with the "hairpin" autocatalytic 
cassette linked to the viral DNA in the antisense direc- 
tion (Treatment D) . This is believed to be due to the 
deletion or disabling of the viral construct containing 
the antisense catalyst, so that viral replication is 
diminished compared to virus control (Treatment B) . 

Next, RNA was isolated from the leaves of the 
nine -week old plants, a probe was prepared and hybridized 
to the isolated RNA, SI nuclease digestion was carried 
out, and the SI digests separated on 10% acrylamide-7M 
urea gels and analyzed by autoradiography, all as 
described in Example 27. The probe used was P 32 - labelled 
autocatalytic cassette RNA, The results are shown in 
Figure 27A, where the 3'F and 5'F resulting from in vivo 
RNA cleavage were found only in plants infected with 
pCS101HC7 having the "hairpin" autocatalytic cassette 
inserted in the sense direction (see Lane 5) • None of 
the other control plants, including those infected with 
the antisense construct pCS101HC9, showed these cleavage 
products, (see Lanes 3, 4 and 6) . 

RNA isolated from the leaves of the nine -week 
old plants was also subjected to Northern blot analysis. 
Total RNA was prepared by the guanidium thiocyanate 
method (Chirgwin et al., Biochemistry, 18, 5294-99 
(1979)) followed by pelleting through a 5.7 M CsCl step 
gradient. Then, 2.5 ug of total RNA was denatured at 65°C 
for 5 min. with formaldehyde and formamide. The dena- 
tured RNA was then electrophoresed on a 1% agarose/ 
formaldehyde gel, and the RNA was transferred to a 
Duralon-UV nylon filter (Stratagene) in 10X standard 
sodium citrate (SSC) (0.15 M) . After UV crosslinking, 
the filter was prehybridized in 6X SSC and 0.05X BLOTTO 
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(Sambrook, Fritsch and Maniatis, Molecular Cloning 
§ 1.102 (2nd ed. 1989)) for 3 hrs at 68°C. The "hairpin" 
autocatalytic cassette fragment was labeled with P 32 
using the oligolabeling procedure of Feinberg and 
Vogelstein, Anal . Biochem. . 132 . 6-9 (1983) . Then it was 
heated to 100°C for 5 min. and cooled on ice. This probe 
was added to the prehybridization mix, and incubation was 
continued at 68°C overnight- The nylon filter was washed 
once in 2X SSC, 0.1% SDS for 20 min. at room temperature, 
followed by washing with IX SSC, 0.1% SDS for 60 min. at 
68°C and 0.1X SSC, 0.1% SDS for 60 min. at 68°C. The 
filter was then exposed overnight to Kodak X-OMAT film 
using two lightning plus screens. The results are shown 
in Figure 27B. Of interest is the detection of viral RNA 
transcripts of about the size expected before cleavage 
(8 Kb) and after cleavage (6 Kb and 2 Kb) following 
infection with pCS101HC7 (Lane 3). 

Some reproducible size heterogenity was 
observed in the SI fragments and in the Northern blots. 
This may be due to the rapid degradation by plant 
nucleases of these RNA fragments which contain either a 
2' -3' cyclic phosphate or a 5' hydroxyl after ribozyme 
cleavage. The 5' hydroxyl could mimic naturally 
occurring RNA degradation signals. After infecting 
turnips with pCS101HC7, RNAs of the sizes expected before 
and after RNA cleavage were detected, but the amount of 
the 6 Kb fragment detected was somewhat less than the 2 
Kb fragment. This could be due to enhanced stability of 
2Kb transcript which contained a cyclic phosphate over 
the 6 Kb transcript which contained the 5' hydroxyl. 

To determine if the DNA encoding the "hairpin" 
autocatalytic ribozyme was stable in vivo , total DNA was 
isolated from the leaves of mock- inoculated turnips, 
turnips inoculated with wild- type CMV (pCSlOl) and 
turnips inoculated with pCS101HC7. DNA was isolated from 
plants essentially as described in Murray and Thompson, 
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Nucleic Acids Research . £, 4321-25 (1980) . Oligo- 
nucleotide primers homologous to domains 5' and 3' to the 
Xhol site of the parental CMV clone pCSlOl were used as 
primers to amplify DNA sequences between these domains by 
polymerase chain reaction (PCR) . Primer 1 # which in- 
cludes an EcoRl site, hybridizes 63 bases upstream from 
the Xhol site and has the, following sequence: 

5' -GGAATTCACC CGTCAGTTTT TAATACTGC-3' [SEQ ID 1) 

Primer 2 includes a BamHI site and hybridizes 54 bases 
downstream from the Xhol site and has the following 
sequence: 

5 ' - TGGATCCATT CTAGTATTTTG AGCTTCT-3' [SEQ ID 2] 

The primers were synthesized on an Applied Biosystem 391 
PCR -MATE using phosphoramidite chemistry. PCR was 
performed as described by the vendor of TaQ polymerase 
(Perkin- Elmer) . Briefly, PCR conditions were 94°C for 1 
min., 55°C for 2 min. and 72°C for 3 min. for 35 cycles. 
After PCR amplification, the amplified DNA was size 
fractionated on a 2% agarose gel and stained with 
ethidium bromide. 

The results are shown in Figure 27C. When 
total plant DNA from mock- infected turnips was 
fractionated, no bands were observed (Figure 27C, lane 
1) . Total DNA isolated from plants infected with wild- 
type CMV gave an expected band of about 123 bp (Figure 
27C, lane 2) . DNA isolated from plants infected with 
pCS101HC7 gave an expected band of 225 bp (Figure- 27C, 
lane 3) . In the latter two cases, the amplified DNA was 
the same size as those bands amplified using the intact 
plasmids pCSlOl and pCS101HC7 (Figure 27C, lanes 4 and 
5) . These results indicate that at least the majority of 
the DNA coding for the "hairpin" autocatalytic ribpzyme 
was retained intact in the viral genome after infection. 

Finally, protein extracts were prepared from 
plants that were mock- inoculated, inoculated with 
wild- type CMV and inoculated with pCS101HC7, and the 
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levels of CMV coat protein in the extracts determined by 
Western immunoblot. The extracts were prepared by 
homogenizing turnip leaf tissue in an equal volume of 
phosphate buffered saline using a mortar and pestle. 
Samples were boiled for 10 min. and then spun down in a 
microfuge for 10 min. at 4°C. Protein concentrations of 
the supernatants were determined by the method of 
Bradford, Anal. Bipchem. , 72, 248-54 (1976), and the 
extracts were diluted with an equal volume of 4X Laemmli 
sample buffer and denatured at 100°C for 5 min. (Laemmli, 
Nature . 227 . 680-85 (1970)). The samples were next 
electrophoresed on 10% SDS-polyacrylamide gels with 5% 
stacking gel (id.), followed by electrotransf er of the 
proteins to nitrocellulose at 45 volts, 1.25 hours, in 
transfer buffer as described in Towbin et al., Proc. 
Natl. Acad. Sci. USA . 76, 4350-54 (1979). Prestained 
molecular weight markers (BRL) were used to confirm 
protein transfer. Nonspecific antibody binding was 
avoided by using blocking solution (Johnson et al . , Gene 
Anal. Techn .. 1, 3-8 (1984)). All antibody incubations 
were done at 22°C with gentle agitation. The primary 
antibody was rabbit antiserum to CMV coat protein (1:1000 
dilution in blocking solution) provided by Dr. R. 
Shepherd, University of Kentucky. An anti- rabbit IgG 
alkaline phosphatase conjugate (Sigma, St. Louis) was 
used as the secondary antibody (1:1000 dilution in 
blocking solution) . Visual detection of proteins was 
accomplished using BCIP/NBT (Sambrook et al.. Molecular 
Cloning: A Laboratory Manual (2nd ed. Cold Spring Harbor 
Laboratory 1989)). 

The results are presented in Figures 27D and 
27E. No coat protein was observed in extracts from mock- 
inoculated plant tissues one or two months after infec- 
tion (Figures 27D and 27E, lane 1) . In extracts from 
plants inoculated with wild- type virus, coat protein was 
found in significant quantities at one month and two 
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months post infection (Figures 27D and Figure 27E, 
lane 2) . One month after infection with the ribozyme 
construct pCS101HC7, very little coat protein was 
detected (Figure 27D, lane 3}- Two months after 
infection with pCS101HC7, increased levels of coat 
protein approaching those found after wild- type CMV 
infection were detected (Figure 27E, lane 3). 

The combined results of the pathological 
observations and the SI nuclease, Northern blot, PCR 
amplification and Western immunoblot assays provide 
conclusive evidence that the "hairpin" catalytic RNA 
cleaved viral RNA in vivo in plants. 

Example 29 

A highly conserved sequence in HIV-1 viral RNA 
has been discovered which has favorable properties as a 
potential target site for a suitably engineered "hairpin" 
catalytic RNA, The 16 -base sequence is shown in Figure 
28. Cleavage occurs between the two bases found at 
positions 111/112 as counted from the 5' cap site (all 
HIV-l sequences are from the HIV Sequence Data Base, 
prepared and distributed by Gerald Myers et al., Los 
Alamos National Laboratory, Los Alamos, NM, telephone 
(505) 665-0480). 

When various HIV-1 isolates were compared, only 
two isolates showed variations in the sequence, and the 
variations were, in each case, only a single base change 
(see Figure 29) (sequences shown in Figure 29 are from 
the HIV Sequence Data Base) . The conserved target site 
sequence is found in the 5' leader region of all nine 
HIV-1 mRNAs (see Figure 28) . Thus, a catalytic RNA 
engineered to cleave the conserved target sequence should 
cleave all of these mRNAs and, thereby, prevent or reduce 
the spread of the virus. 

A "hairpin" catalytic RNA was designed accord- 
ing to the principles set forth herein to cleave the 
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conserved target sequence. The catalytic RNA sequence, 
designated herein as "RHIV", is shown in Figure 30. RHIV 
was designed so that it would base pair with the target 
sequence in the two regions flanking the CGUC cleavage 
sequence (see Figure 30) . The "hairpin" portion of RHIV 
has the same sequence as the "hairpin" portion of the 
(-)sTRSV catalytic RNA sequence shown in Figure 1. Bases 
3 ' -CUGAGGG-5 ' at the 5' end of RHIV are vector bases and 
are not necessary for activity of the catalyst. RHIV was 
synthesized as described in Example 2. 

A substrate RNA containing the conserved 16- 
base target site sequence shown in Figure 28 plus 
additional GCG vector bases at its 5' end was also 
synthesized as described in Example 2. This substrate 
RNA is designated herein as "SHIV" . The sequence of SHIV 
is shown in Figure 30. 

The substrate RNA SHIV and catalytic RNA RHIV 
were then reacted under standard conditions as described 
in Example 3 over a period of time from 0 to 280 minutes. 
The concentration of substrate SHIV was 0.025 uM, and the 
concentration of RHIV was 0.005 uM. Cleavage of sub- 
strate SHIV (which contains the conserved HIV-l sequence) 
by the engineered "hairpin" catalytic RNA RHIV was 
obtained (isee Figure 31) . 

The kinetics of the reaction between SHIV and 
RHIV were investigated using the methods described in 
Example 5. The results are shown in Figure 32. The time 
of incubation was 5 minutes, and the concentration of 
RHIV was 0.005 uM. The concentration of SHIV was: Lane 
1--0.10 uM; Lane 2- -0.05 uM; Lane 3- -0.025 uM; Lane 4-- 
0.012 uM; Lane 5- -0.006 uM; and Lane 6- -0.025 uM (this is 
the control lane at zero time) . 

The rate of cleavage (turnover number, kcat) 
was determined to be 1.6 /minute (see Figure 32). This is 
a very rapid rate when compared to any other catalytic 
RNA under the mild conditions of temperature, pH and salt 
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concentration used- The Km was determined to be 100 nM 
(see Figure 32) , which is also very small for an RNA 
catalyst under these conditions. The low Km indicates 
that the reaction proceeds very well at extremely low 
concentrations of substrate- 

Next, the ability of the catalytic RNA RHIV to 
cleave a long HIV-l transcript was tested . The RNA used 
as a target sequence was a 183nt-long transcript contain- 
ing 107 nt of HIV-l sequence. The target transcript was 
made by T7 RNA polymerase transcription (as described in 
Example 2) of the Haell- linearized plasmid pROS. This 
plasmid was obtained from Dr. John Rossi of the City of 
Hope Medical Center, Duarte, California. Plasmid pROS 
has the S'-Hindlll fragment of the HIVHXB2 isolate cloned 
into plasmid pBluescript II (Stratagene, LaJolla, CA) . 
The S'-Hindlll fragment of HIVHXB2 contains the sequence 
(+)77 to (+)635 from the 5' cap site (sequence obtained 
from the HIV Sequence Data Base) . The 183nt transcript 
contains 76nt of vector sequence at the 5' end, followed 
by I07nt of HIV-l sequence (the sequence from ( + )77 to 
(+)183 from the 5' cap site). 

This I83nt transcript (5nM) was incubated with 
catalytic RNA RHIV (25nM) under standard conditions for 
60 minutes as described in Example 3. When the products 
were analyzed on 6% acrylamide-7M urea gels, cleavage was 
seen to have occurred (see Figure 33) . All mobilities in 
Figure 33 were as expected. This result shows that RHIV, 
a "hairpin" catalytic RNA according to the invention, can 
cleave a long HIV-l RNA transcript in vitro under condi- 
tions near physiological for mammalian cells. 

Example 30 

As a prerequisite for in vivo experiments, it 
was necessary to develop a system for the proper 
termination of the "hairpin" catalytic RNA RHIV when it 
was transcribed in vivo . Such a system was developed by 
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cloning DNA coding for RHIV into the BamHI/MluI sites of 
the vector pHC (prepared as described in Example 23) to 
give plasmid pHR (see Figure 34) . In plasmid pHR, DNA 
coding for RHIV is 5' to DNA coding for the autocatalytic 
cassette RNA (see Figure 20) . 

Plasmid pHR was transcribed with T7 RNA 
polymerase and the resulting lOlnt RNA (see Figure 34) 
isolated, all as described in Example 2. The catalytic 
activity of this lOlnt RNA transcript (designated herein 
as "PRHIV") was assayed against the substrate SHIV (see 
Figure 30) as described in Example 3. The concentration 
of catalytic RNA (RHIV or PRHIV) was 0,025 uM, and the 
concentration of SHIV was 0.10 uM. The lOlnt PRHIV had 
the same catalytic activity as the shorter RHIV (see 
Figure 35 where R designates RHIV and PR designates 
PRHIV), indicating that the extra sequence at the 3' and 
5' ends (see Figure 34) did not interfere with cleavage 
of the substrate. 

Next, a mammalian expression vector was 
prepared by removing DNA coding for RHIV and the auto- 
catalytic cassette RNA from pHR with Smal/Sall (see 
Figure 34) . The resulting fragment was cloned into the 
mammalian vector pMSG (see Example 27) cut with Smal/Sall 
to give pMSGRHIV (see Figure 36) . In pMSGRHIV, the 
catalytic RNA is driven by the MMTV promoter which is 
dexamethasone inducible . 

Vector pMSGRHIV has been used to transf ect 
human cells which were then infected with HIV-l. 
Preliminary results show that the amount of tat and gag 
mRNAs produced in transfected cells was lower than the 
amount produced in control cells. 

Example 31 

In this example, evidence is presented showing 
the in vivo down regulation of two genes in mammalian 
cells by using "hairpin" RNA catalysts in trans . The two 
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genes studied were hypoxanthine guanosine phosphoribosyl 
transferase (HGPRT) and chloramphenicol acetyl 
transferase (CAT) . 

A. In Vivo Inh ibition of HGPRT 

A 15nt target sequence in the HGPRT gene of the 
hamster was identified. The sequence is: 

AUUCC*GUCAUGGCGA [SEQ ID 3] 

A "hairpin" catalytic RNA was designed accord- 
ing to the principles set forth herein to cleave this 
target sequence- The catalytic RNA sequence, designated 
"RHGPT, " is shown in Figure 37. RHGPT base pairs with 
the target sequence in the two regions flanking the CGUC 
cleavage sequence, and the "hairpin" portion of RHGPT has 
the same sequence as the "hairpin" portion of the 
(-)sTRSV catalytic RNA sequence shown in Figure 1. Addi- 
tional vector bases coding for restriction sites are 
present at the 3' and 5' ends of RHGPT. 

RHGPT and the target sequence were synthesized 
as described in Example 2 and tested in vitro as de- 
scribed in Example 3. The results of the test showed 
that the target sequence was cleaved in vitro by the 
engineered "hairpin" ribozyme RHGPT at the * in the 
sequence given above (which is after nt87 in the sequence 
of the HGPRT gene) . 

Next, DNA coding for ribozyme RHGPT was cloned 
into the BamHI/MluI sites of the mammalian expression 
vector pHC (prepared as described in Example 23) . The 
DNA coding for RHGPT and HC was excised from the result- 
ing plasmid with Smal/Sall and cloned into Smal/Sall 
digested pMSG (Example 27) to give plasmid pMRHPT, a map 
of which is shown in Figure 37. The DNA coding for RHGPT 
is located downstream of the dexamethasone- inducible MMTV 
promoter on pMRHPT and upstream of the "hairpin" auto- 
catalytic cassette (HC) which is included bo that RHGPT 
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will be properly terminated at the 3' end after the RNA 
is transcribed. 

Another plasmid, pMR2HPT, was also prepared. 
This plasmid is identical to pMRHPT, except that a 
G35-->C mutation in RHGPT was made (numbering according 
to Figure 42D) . Accordingly, the resulting ribozyme 
produced by this plasmid would be inactive (see 
Example 22) . 

Chinese hamster ovary cells (CHO) were co- 
transfected with one of these plasmids and plasmid pMSG- 
dhfr. The method of transfection and plasmid pMSG-dhfr 
are described in Example 27. Transf ectants were selected 
and amplified with methotrexate, also as described in 
Example 27. A schematic drawing of the selection scheme 
is presented in Figure 38. 

The poison 8-azaquanine, which is allowed into 
cells by the HGPRT enzyme, was then added to the cells 
(80/jg/ml) both in the presence and absence of dexa- 
methasone (ljiM) which induces the HGPRT ribozyme. In the 
absence of dexamethasone, cells with or without DNA 
coding for RHGPT died as expected. In the presence of 
dexamethasone, control cells died, but colonies of 
survivors were seen for cells transf ected with pMRHPT. 
All cells transformed with pMR2HPT (coding for the 
inactive ribozyme) died. 

To quantitate the rate of survival, CHO cells 
were grown in gpt medium (see Example 27) , except un- 
transfected, uninduced CHO cells which were grown in MEM. 
The concentration of 8-azaquanine used was 80^g/ml, and 
the concentration of dexamethasone used was 1/iM. One 
hundred cells were plated per dish. Colonies of cells 
were stained with crystal violet. The results are 
presented below: 
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Cells 



Percent S\irvival 



Untransf ected, 
uninduced 



100%* 



Transfected with 
pMRHPT and induced 



33% 



Transfected with 

pMRHPT but uninduced 



0% 



Transfected with 



0% 



pMR2HPT and induced 

* Results were normalized with untransf ected, 

uninduced cells set at 100%. Actual survival rate 
for these cells was 80%. 

The results are consistent with reduced levels 
of HGPRT due to the cleavage of HGPRT mRNA by the 
engineered "hairpin" ribozyme RHGPT after induction with 
dexamethasone . The in vivo activity of the ribozyme is 
not likely due to antisense effects, since the disabled 
ribozyme coded for by pMR2HPT was ineffective in increas- 
ing resistance to 8-azaquanine. Note that the mutation 
in pMR2HPT is not in the area of the ribozyme that base 
pairs to the substrate, so binding to the substrate 
should occur. 

Next, an Si nuclease assay was performed to 
observe the in vivo levels of mRNA coding for HGPRT. The 
SI nuclease assay was performed as described in 
Example 27. The 148 nt probe which was used hybridizes 
to HGPRT mRNA and was prepared by transcribing plasmid 
pHPTPr as described in Example 27 using P 32 - labelled CTP. 
Plasmid pHPTPr contains the antisense HGPRT sequence from 
nt 20-160 (Konecki et al., Nuqleic Acid? Res. , 10, 
6763-75 (1982)) bridging the cleavage sequence at nt 87 
cloned into pT218R (US Biochemical) between the EcoRI and 
Hindlll sites. 

The results are shown in Figure 39. In Fig- 
ure 39, lane 3 contains RNA from cells transfected with 
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pMRHPT and pMSG-dhfr but not induced, and lane 4 contains 
RNA from cells transfected with pMRHPT and pMSG-dhfr 
which were induced with dexamethasone . A 30% reduction 
in the level of mRNA was observed when the "hairpin" 
ribozyme was induced (compare lanes 3 and 4) . This shows 
that the engineered "hairpin" ribozyme RHGPT reduces 
HGPRT activity by lowering the amount of mRNA, and the 
likely mechanism is cleavage of the HGPRT mRNA. 

B. In Vivo Lowering of CAT mRNA Levels. 

From a series of j.n vitro experiments using the 
techniques described in Examples 2 and 3, the optimum 
target sequence in the CAT gene was determined to be: 

5' -UUUCA*GUCAGUUGCUCAA-3' [SEQ ID 5] 

with cleavage at (*) , which is nt 320 of the CAT gene. 
The "hairpin" catalytic RNA designed to cleave this 
target sequence, designated "RCAT, " is shown in Fig- 
ure 40. RCAT was designed according to the principles 
set forth herein. It base pairs with the target sequence 
in the two regions flanking the AGUC cleavage sequence, 
and the "hairpin" portion of RCAT has the same sequence 
as the "hairpin" portion of the (-)sTRSV catalytic RNA 
sequence shown in Figure 1. Additional vector bases 
coding for restriction sites are present at the 3' and 5' 
ends of RCAT. 

DNA coding for RCAT was ligated to pHC (Example 
23) which had been cut with BamHI/MluI. The resulting 
plasmid was cut with Smal/Sall to remove the fragment 
coding for RCAT and HC. This fragment was ligated to 
pMSG (Example 27) which had been cut with Smal/Sall to 
give pMSGRCAT. Next pCAT (Promega) was cut with Pvul/ 
PstI, and the fragment containing the CAT gene under the 
control of the SV40 promoter and enhancer was isolated on 
a 1% low-melting agarose gel (Nusieve) . Then the frag- 
ment was blunt ended with Klenow fragment and ligated 
into the EcoRI-cut, blunt -ended pMSGRCAT to give the 
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final plasmid pMCATRCAT {see Figure 40) . This plasmid 
contains DNA coding for RCAT under control of the 
dexamethasone- inducible MMTV promoter and the CAT gene 
under control of the SV40 promoter* Further, DNA coding 
for the autocatalytic "hairpin" ribozyme is located 
downstream of DNA coding for RCAT so that RCAT will be 
properly terminated at the 3' end upon transcription of 
the RNA. 

Plasmid pMCATRCAT was cut with Ndel and then 
used to transfect CHO cells along with Xhol-cut pMSG-dhfr 
as described in Example 27. Transf ectants were selected 
and amplified and an SI nuclease assay performed, all as 
described in Example 27. The probe used for the SI 
nuclease assay was RNA transcribed from the plasmid 
pCATP. This plasmid contains the sequence of the CAT 
gene from nt 260-372 (GenBank sequence) bridging the 
cleavage sequence at nt 320 cloned into the EcoRI/Hindlll 
site of pTZ18R (US Biochemical) in the antisense direc- 
tion. The probe was prepared as described in Example 27 
using 32 P-CTP and was a total of 119 nt long. 

The results of the SI nuclease assay are pres- 
ented in Figure 41. A reduction in CAT mRNA was observed 
in cells transf ected with pMCATRCAT and pMSG-dhfr and 
induced with dexamethasone as compared to uninduced 
cells. Thus, the engineered "hairpin" catalyst RCAT did 
reduce the level of CAT mRNA. However, further attempts 
to locate the mRNA cleavage products have so far failed, 
and no lowering of CAT enzymatic activity was seen. 

Example 32 

Additional mutagenesis experiments were per- 
formed changing bases in {-)sTRSV RNA and its substrate. 
All substrate RNAs and catalytic RNAs were prepared as 
described in Example 2. Mutagenesis was carried out 
simply by making the required base change in the 
synthetic DNA template. All catalytic RNAs had addi- 
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tional vector bases 6GG. at the 5' end, and all substrate 
RNAs had additional vector bases GCG at the 5' end. 
These bases are required for efficient transcription 
(Milligan et al., Nucleic Acids Res. . 15 . 8783-98 
(1987)), and the C near the 5' end of all substrates 
ensured at least one P 32 - labelled C in the 5' cleavage 
fragment. The reference sequences were the unmutated 
catalytic and substrate sequences (see Figure 1) . 

Substrate and catalytic RNAs were assayed for 
catalytic activity as described in Example 3. Generally, 
the final concentrations of substrate RNA was O.luM, and 
the final concentration of catalytic RNA was O.OluM. 
Assays were done at 37°C for times ranging from 15-30 
minutes, and a zero- time control was always included. 
Reaction products were analyzed on 15% acrylamide/7M urea 
gels, autoradiography performed, and the bands cut from 
the gels and counted. The control (unmutated ribozyme/ 
substrate) was assayed at the same time as all mutant 
catalytic RNAs and substrates. 

Nucleotide changes made in the native, 
unmutated sequence of both the catalytic RNA and the 
substrate showed a range of catalytic effects. The 
nucleotide changes shown in Figure 42A had very little 
effect on the catalytic activity of the ribozyme (50-100% 
of the activity of the unmutated sequence) , while other 
nucleotide changes (Figure 42B) had an intermediate 
effect on catalytic activity (5-50% of the activity of 
the unmutated sequence) . Those mutational changes 
resulting in very low or no catalytic activity (less than 
5% of the activity of the unmutated sequence) are shown 
in Figure 42 C. The results further define the two- 
dimensional structure of the (-)sTRSV catalytic complex. 
A revised "hairpin" structure for the catalytic complex 
is shown in Figure 42D. In Figures 42A-D, the ribozyme 
is numbered consecutively l-50nt and the substrate l-16nt 
(see Figure 42D) . Also, in Figures 42A-D, upper case 
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letters are used for the ribozyme nucleotides and lower 
case letters are used for the substrate nucleotides. 

In summary, the results show that Helices 2 and 
3 (see Figure 42D) are not continuous but have an un- 
paired base between them. Helix 4 was found to be 
shorter and Loops II and IV were found to be larger than 
previously predicted by computer modeling. Also, certain 
bases in Loops II and IV were found to be invariant. 
Helix 4 can be extended towards the closed end (Loop III) 
of the "hairpin" to give increased stability to the 
ribozyme, and the sequence of Loop III can be mutated 
with retention of catalytic activity. While conventional 
base pairing interactions between Loop I on the ribozyme 
and Loop V of the substrate were not observed, an A-->C 
mutation in ribozyme Loop I partly restored activity to a 
previously inactive c-->a mutation in substrate Loop V, 
indicating that some type of interaction between these 
two bases may be occurring. Finally, the data show that 
the first base pair upstream of the N*GUC cleavage 
sequence cannot be an A:U or U:A, it must be G:C or C:G. 
It is believed that the G:C or C:G base pair is necessary 
for stability of the catalyst -substrate complex and that 
A:U or U:A base pairs can be used if sufficient stability 
is provided by other means such as possibly lengthening 
Helix l. Indeed, it has been found that A:U and U:A base 
pairs can be used at these positions in the synthetic 
autocatalytic catalyst of the invention. Accordingly, in 
engineering a "hairpin" catalyst based on (-)sTRSV, the 
substrate RNA preferably contains the target sequence 
5' -SN*GUC-3' , where S is G or C and cleavage occurs at 
the *. 

The results will now be discussed in detail. 
First, Table I lists all substrate sequences successfully 
cleaved by a catalytic RNA designed according to the 
"hairpin" model so that the bases flanking the N*GUC 
cleavage sequence in the substrate were base paired to 
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the catalyst. Cleavage occurred at the *. The lower 
case letters in Table I designate additional vector 
sequences . 

TABLE 1 





Helix2 


LoopV 


Helixl 








1 


gcg 


UGAC 


A*GUC 


CUGUUU 


[SEQ 


ID 


8) 


2- 


gcg 


UGAC 


A*GUC 


CUGUUUUUUU 


[SEQ 


ID 


9] 


3 


gcg 


UGAC 


A*GUC 


CUGUUUUUUUCGC 


[SEQ 


ID 


10] 


4 


gcg 


UGUC 


A*GUC 


CUGUUU 


[SEQ 


ID 


11] 


5 


gcg 


UGAG 


A*GUC 


CUGUUU 


[SEQ 


ID 


12] 


6 


g 


AAAC 


A*GUC 


CCCAAC 


[SEQ 


ID 


13] 


7 


g 


UUUC 


A*GUC 


AGUUGC 


[SEQ 


ID 


14] 


8 


gcg 


UUUC 


A*GUC 


AGUUGCUCAA 


[SEQ 


ID 


15] 


9 


gcg 


CCCC 


U*GUC 


CCCGAG 


[SEQ 


ID 


16] 


10 


gcg 


UGGG 


U*GUC 


GACAUAgc 


[SEQ 


ID 


17] 


11 


gcg 


UGAC 


A*GUC 


GUGUUU 


[SEQ 


ID 


18] 


12 


gcg 


UGAC 


A*GUC 


AUGUUU 


[SEQ 


ID 


19] 


13 


gcg 


AGAG 


C*GUC 


GGUAUUAA 


[SEQ 


ID 


20] 


14 


gcg 


AGAG 


C*GUC 


GGUAUUAAGCGG 


[SEQ 


ID 


21] 


15 


gcg 


AGAG 


C*GUC 


GGUAUUAAGC 


[SEQ 


ID 


22] 


16 


gcg 


UUUC 


U*GUC 


GUUUAACU 


[SEQ 


ID 


23] 


17 


gcg 


UGAC 


U*GUC 


CUGUUU 


[SEQ 


ID 


24] 


18 


gcg 


UGAC 


C*GUC 


CUGUUU 


[SEQ 


ID 


25] 


19 


gcg 


UGAC 


G*GUC 


CUGUUU 


[SEQ 


ID 


26] 


20 


gcg 


UGCC 


C*GUC 


UGUUGUGUGA 


[SEQ 


ID 


27] 


21 


gcg 


UGCC 


C*GUC 


UGUUGUGU 


[SEQ 


ID 


28] 


22 


gcg 


CCAC 


U*GUC 


GAUCGA 


[SEQ 


ID 


29] 


23 


gcg 


CCAC 


U*GUC 


GAUCGAG 


[SEQ 


ID 


30] 


24 


gcg 


iTTni"" 1 

AUUU 


C*GUC 


AUGGCGA 


[SEQ 


ID 


31] 


25 


gcg 


AUUC 


C*GUC 


AUGGC 


[SEQ 


ID 


32] 


26 


gcg 


AUGC 


G*GUC 


ACUCAUUA 


[SEQ 


ID 


33] 


27 


gcg 


AUGC 


G*GUC 


ACUCAU 


[SEQ 


ID 


34] 


28 


gcg 


AUCC 


U*GUC 


CAUUCAA 


[SEQ 


ID 


35] 


29 


gcg 


AUCC 


U*GUC 


CAUUCAAG 


[SEQ 


ID 


36] 


30 


gcg 


UUGG 


U*GUC 


GACCUGAA 


[SEQ 


ID 


37] 


31 


gcg 


ACAG 


C*GUC 


UGCUCC 


[SEQ 


ID 


38] 


32 


gcg 


UUGC 


G*GUC 


GCUACG 


[SEQ 


ID 


39] 


33 


gcg 


UUGC 


G*GUC 


GCUACGUC 


[SEQ 


ID 


40] 


34 


gcg 


UCUC 


A*GUC 


ACUAUG 


[SEQ 


ID 


41] 


35 


gcg 


CACC 


U*GUC 


ACAUAA 


[SEQ 


ID 


42] 


36 


gcg 


CACC 


U*GUC 


ACAUAAUU 


[SEQ 


ID 


43] 


37 


gc 


GUGG 


U*GUC 


UGUGGA 


[SEQ 


ID 


44] 



As shown in Table I, every base pair in Helices 
1 and 2 can be changed to any other base pair, and the 
substrate will be cleaved by the ribozyme, except the 
base pair in Helix 2 adjacent to the N*GUC cleavage 
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sequence (designated by S in Figure 42D) . When base Gil 
in the ribozyme was changed to C, catalytic activity was 
lost (see Figure 42C) . When a second mutation (c4-->g) 
was made in the substrate so that base pairing was 
restored, catalytic activity was also restored. However, 
when catalysts and substrates having A:U base pairs at 
these positions were tested, no catalytic activity was 
observed. A:U base pairs in this position were checked 
for catalytic activity with a variety of substrates and 
corresponding ribozymes, and all of them were found to be 
inactive. Accordingly, this base pair must be G:C or C:G 
and cannot be A:U or U:A unless, as discussed above, 
other measures are taken to stabilize the substrate- 
catalyst complex. 

As already demonstrated in Examples 18 and 21, 
it is possible to adjust the length of Helix 1 to optim- 
ize the rate of cleavage. In particular, when the native 
sequence was extended four base pairs by adding four A:U 
base pairs to the open end of Helix 1, an increased rate 
of activity was seen (Example 21) . However, when three 
additional G:C base pairs were added in the present 
experiments, a large loss of activity occurred. This 
phenomenon was observed for numerous substrates and 
ribozymes. Accordingly, Helix 1 has an optimal length 
for each substrate used. 

A type of "hinge" region, consisting of a 
single A base at position 15, is present between Helices 
2 and 3 of the catalytic RNA (see Figure 42D) . When the 
A15:U49 potential base pair was changed to the 
compensatory base pair U15:A49 by a double mutation, 
activity remained at nearly 100% (Figure 42A) . The 
single U49->A49 mutation, which would lead to an A: A 
mismatch, also had no effect on activity (Figure 42A) , 
showing that no base pair was needed at this position. 
Accordingly, the results show that a base pair does not 
exist between bases A15 and U49 of the ribozyme. 
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Proof of the base pair C17:G47 was obtained 
previously (see Example 22), but the presumptive base 
pair next to it, C16:G48, could not be shown to exist. 
Both the catalyst containing the C:C mismatch and the 
catalyst containing the reverse G:C base pair were 
inactive (Figure 42C) . Since this G and C are opposite 
each other in this position and are adjacent a base pair, 
it is highly likely they are actually base paired as 
well. However, the fact both the mismatch and reverse 
base pair were inactive suggests that the identity of the 
bases must be maintained in this position. Thus, it is 
likely that this base pair exists and must be C16:G48 as 
shown. 

At the end of Helix 3 is the predicted base 
pair G19:C45. The catalyst containing the C:C mismatch 
was inactive (Figure 42C) , and the catalyst containing 
the reverse base pair C19:G45 was active (Figure 42B) , 
showing that a base pair exists in this position. 

Base pairing between A18.-U46 is shown in Fig- 
ure 42D even though mutagenesis was not done. The 
existence of a base pair at this position is likely since 
it would be located between two proven G:C base pairs. 

Helix 4 is a shorter helix than predicted by 
straightforward computer modeling and two-dimensional 
energy minimization (compare Figures 1 and 19 with 
Figure 42D) . This helix was previously shown to exist by 
showing that the base pair C27:G35 actually existed (see 
Example 22). The next base pair downstream, A28:U34, 
also exists. Catalysts containing the single mismatch 
mutations, U34->A or C, were inactive (Figure 42C) , and 
catalysts containing the compensatory double mutations to 
form either U28:A34 or G28:C34 base pairs restored 
activity (Figure 42A) . 

The following results show that an active 
ribozyme is produced when Helix 4 is extended and the 
sequence of Loop III is changed. As shown in Figure 42A, 
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Loop III was replaced with the common and very stable RNA 
hairpin sequence 5' -GGAC(UUCG)GUCC-3' [SEQ ID 45] 
characterized by Tinoco and colleagues (Cheong et al., 
Nature . 146, 680-82 (1990)); Varani et al., Biochem, , 10, 
3280-89 (1991). As a result of this substitution, Helix 
4 was extended by four base pairs and the GUU sequence of 
Loop III was replaced with the sequence UUCG (see Figure 
42A) . The resulting RNA catalyst was active. In fact, 
the activity of this ribozyme was greater than that of 
the unmutated form. Further, the mutant ribozyme was 
more thermal stable. It remained active at 45°C, while 
the unmutated RNA catalyst loses most of its activity at 
this temperature (see Figure 43) . 

It was concluded from this experiment that Loop 
III does not have a conserved or invariant base sequence 
and that Helix 4 can be extended towards loop III by at 
least four base pairs without loss of activity. The four 
additional base pairs in Helix 4 should provide helix 
stabilization of this region. The secondary folding 
energy of Helix 4 and Loop III in the native structure is 
+0.6 Kcal/mole, while that of the catalyst having the 
extended Helix 4 and the Loop III of the sequence UUCG 
was determined to be -11.1 Kcal/mole (methods described 
in Example 6) . Thus, the presence of the Tinoco et al. 
hairpin sequence increases the folding energy by 11.7 
Kcal/mole. 

However, the simple replacement of Loop III 
with the sequence UUCG (see Figure 42C) gives an inactive 
ribozyme. This is believed to occur because the bases 
of Loop III help stabilize Helix 4. 

When Loop III of the native ribozyme is cut 
between U31 and U32, activity is lost (see Figure 42C) . 
A likely explanation for this is that when the loop is 
cut, Helix 4' opens up and catalytic activity is, conse- 
quently, lost. The "cut" ribozyme was prepared by 
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synthesizing the ribozyme in two parts and allowing the 
parts to anneal. 

With the extension of Helix 4 by the Tinoco 
et al. hairpin sequence, the potential base pair C29:G33 
is between the two proven bases of Helix 4 and the four 
base pairs of the Tinoco sequence. Accordingly, it 
seemed likely that this base pair existed. However, both 
the single mutation G33-->C and the double mutation 
C29 :G33-->G29 :C33 were inactive (see Figure 42C) . Either 
a base pair does not exist at this position or the 
identity of the bases must be maintained. This second 
possibility seems the most likely since the ribozyme 
likely needs to be involved in specific three-dimensional 
folding to carry out catalysis, and specific functional 
groups on the bases may be involved in the folding and/or 
the catalysis itself. 

The existence of other base pairs in Helix 4 
could not be shown. The next potential base pair up- 
stream in Helix 4 was C25:G36. The single mutation 
C25->G was inactive (Figure 42C) . The double mutation, 
G25:C36, was also inactive (Figure 42C) # indicating that 
a base pair does not actually exist. Although the base 
pair could exist with the identity of the bases being 
required, this appears unlikely since the G and C are not 
directly opposite each other since there is A between the 
C and the next proven base pair. By the same sort of 
analysis, the next potential base pair upstream, A24:U37, 
was shown not to exist. When the mutation 
A24 :U37->G24 :C37 was made, no activity was seen (Fig- 
ure 42C) , indicating that a base pair does not exist at 
this position. 

Further mutation studies showed that Loops II 
and IV are larger than originally predicted by computer 
modeling and energy minimization (compare Figures 1 and 
42D) . Also, some of the bases in these loops are 
required for activity. 
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As already shown in Example 12, when bases A22, 
A23 and A24 in Loop II were mutated to GUC, a totally 
inactive ribozyme was obtained, indicating that one or 
all of these bases are essential. The single mutation in 
Loop II of C25->G was inactive (see Figure 42C) . The 
double mutation C25->G + G36->C was also inactive (see 

Figure 42C) . 

in Loop IV, the single mutation U39->G was 
active (see Figure 42A) . indicating that this base was 
not essential for catalytic activity and was not involved 
in a base pair. The single mutation A43->U was inactive, 
as was the double mutation A43->U and U37->A. This 
result was significant because the possibility existed 
for a stem to occur in Loop IV: 



36 
G U A 

U 
A 
U 

C A U 
44 



If this stem existed, the A43:U37 alternate base pair of 
U43:A37 should have been active. Since it was not, it 
was concluded that a stem in Loop IV does not exist. 

In Loop V, the loop formed by the cleavage 
sequence in the substrate molecule, the bases guc are 
invariant and are not base paired to the catalyst (see 
Example 25 and Figure 42C) . 

All of the bases of Loop I in the ribozyme, the 
loop opposite Loop V, have been mutated. When base A7 
was changed to a G or C and when base A10 was changed to 
a G, the resulting catalysts were active (Figure 42A) . 
However, when base G8 was changed to a U or C or when 
base A9 was changed to U, inactive ribozymes were 
produced (see Figure 42C) . 

Potential base pairs between Loop V in the 
substrate and the Loop I in the ribozyme were tested. In 
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particular, experiments to determine whether G8 and A9 of 
Loop I were base paired with the corresponding bases in 
substrate Loop V were performed. Mutations tested were 
G8->C:c8->g and A9->U:u7->a. These mutations were in- 
active (see Figure 42C) , showing that, for the bases 
checked, no base pairs existed. 

While the c9-->a mutation in substrate Loop V 
was inactive (Figure 42C) and the A7-->C mutation of 
ribozyme Loop I was fully active (Figure 42A) , the double 
mutation of c9->a and A7->C showed partial activity (see 
Figure 42B) . This suggests that some type of interaction 
may be occurring between base c9 of the substrate and 
base A7 of the ribozyme which are opposite each other in 
the loops. 

A triple mutation comprising this same double 
mutation plus changing base A20->C in the ribozyme showed 
the same level of activity as the double mutation (Figure 
42B) . The triple mutation represents base changes in the 
ArMV sequence proposed to be catalytic (Gerlach and 
Haseloff, Gene , 82 , 43-52 (1989)) and would suggest that 
a triple base interaction might occur between the three 
bases. However, the catalyst containing the triple muta- 
tion gave catalytic activity at the same level as the 
double mutation, arguing against a triple base inter- 
action. The results obtained with the catalyst contain- 
ing the double mutation, however, indicate that an inter- 
action of some type takes place between substrate C9 and 
ribozyme A7. 

As various changes could be made in the above* 
described products and methods without departing from the 
scope of the invention, it is intended that all matter 
contained in the above description or shown in accompany- 
ing drawings shall be interpreted as illustrative and 
shall not be interpreted in a limiting sense. 
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